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PREFACE 


Prediction  problems  frequently  arise  in  which  the  regression  weights 
must  be  based  on  a  relatively  small  number  of  criterion  observations.  In  such 
cases,  current  techniques  permit  the  utilization  of  only  a  very  few  predictors, 
even  though  many  more  may  be  available.  Unless  one  or  more  of  the  pre¬ 
dictors  is  closely  related  to  the  criterion,  accurate  predictions  cannot  be  made. 
The  possibility  of  increasing  the  accuracy  of  prediction  under  such  circum¬ 
stances  through  the  use  of  reduced-rank  methods  is  investigated  in  this  study. 

On  the  basis  of  normal  regression  theory,  a  general  reduced-rank  model 
is  formulated  in  terms  of  prediction  from  factor  scores.  The  problems  of 
selecting  a  method  of  factoring,  of  selecting  an  optimal  subset  of  prespecified 
size  from  among  a  given  set  of  factors,  and  of  selecting  an  optimal  rank  are 
considered.  It  is  shown  that  in  the  absence  of  criterion  observations,  the 
optimally  chosen  reduced-rank  solution  will  be  the  one  that  accounts  for  the 
greatest  proportion  of  variance  in  the  full-rank  predictor  matrix.  Prediction 
either  from  subsets  of  the  original  predictors,  which  are  equivalent  to  tri¬ 
angular  factors,  or  from  principal-axes  factors  is  considered.  It  is  concluded 
that,  when  degrees  of  freedom  are  sufficiently  limited,  the  most  accurate 
predictions  obtainable  will  be  those  based  on  the  largest  principal-axes  factors. 
As  a  tentative  solution  to  the  problem  of  optimal  rank,  estimates  are  derived 
which  are  intended  to  indicate  the  accuracy  of  prediction  to  be  expected 
when  regression  weights  computed  on  the  basis  of  data  in  one  sample  are 
applied  to  data  in  other  samples. 

An  empirical  comparison  of  five  reduced-rank  methods  is  carried  out, 
employing  a  variety  of  ranks,  sample  sizes,  and  criteria.  The  five  methods 
include  prediction  from  the  principal-axes  factors,  selected  in  three  different 
ways,  and  from  the  original  predictors,  selected  in  two  different  ways.  The 
results  indicate  that  weights  computed  by  the  method  of  largest  principal- 
axes  factors  on  samples  with  as  few  as  30  cases  can  give  predictions  as  accurate 
as  those  from  weights  computed  by  conventional  techniques  on  samples  of 
several  hundred  cases. 

The  present  monograph  was  submitted  as  a  doctoral  dissertation  at  the 
University  of  Washington  in  July  1962.  The  writer  wishes  to  thank  his 
sponsor,  Professor  Paul  Horst,  for  the  invaluable  blend  of  criticism  and 
encouragement  that  he  provided.  The  work  for  the  present  monograph  was 
largely  supported  by  Office  of  Naval  Research  Contract  Nonr.  477(33)  and 
Public  Health  Research  Grant  M-743(C7)  (principal  investigator:  Paul 
Horst).  Acknowledgment  is  due  Mrs.  Judy  Goodstein  and  Mrs.  Helen  Ranck 
for  their  work  in  typing  and  proofreading  the  manuscript. 

George  R.  Burket 

Pittsburgh,  Pennsylvania 
October,  1963 
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CHAPTER  1 


INTRODUCTION 

Basic  Requirements 

Accurate  predictions  of  an  individual’s  degree  of  success  or  failure  in 
such  socially  significant  activities  as  a  college  course,  training  for  some  voca¬ 
tion,  or  a  particular  job  would  be  of  incalculable  utility,  both  to  the  individual 
concerned  and  to  the  community.  Remarkably  accurate  predictions  of  this 
nature  can  be  obtained  with  existing  statistical  techniques,  provided  that  two 
basic  requirements  are  satisfied.  1  irst,  there  must  be  measurements  available 
on  a  number  of  variables  related  to  performance  in  the  activity  of  interest. 
It  must  be  possible  to  obtain  these  measurements  on  any  individual  before 
he  engages  in  the  activity.  Second,  such  measurements  must  be  obtained  for 
a  large  number  of  persons  who  subsequently  engage  in  the  activity. 

The  first  requirement  can  almost  always  be  mot.  Indeed,  it  is  usually 
possible  to  find  many  variables  having  at  least  some  relation  to  performance 
in  the  criterion  activity.  To  obtain  measurements  on  a  large  number  of 
variables  may  be  expensive,  but  accurate  predictions  of  many  activities  are 
of  sufficient  value  to  warrant  large  expenditures.  The  second  requirement  is 
much  less  likely  to  be  satisfied,  since  the  number  of  persons  who  actually 
engage  in  a  particular  activity  is  often  limited.  This  is  particularly  true  for 
activities  requiring  an  unusual  degree  of  ability,  where  accurate  predictions 
are  apt  to  be  most  desired.  Many  socially  significant  activities  are  full-time 
occupations  which  individuals  must  pursue  for  years  before  their  success  or 
failure  can  be  determined.  If  the  number  of  persons  engaging  in  such  an 
activity  is  too  small  to  permit  application  of  existing  techniques,  no  feasible 
expenditure  will  yield  accurate  predictions.  Wc  need  new  techniques. 

The  Statistical  Model 

V  system  for  obtaining  the  best  possible  predictions  for  a  given  criterion 
would  be  the  following.  First,  determine  all  variables,  termed  predictors,  not 
statistically  independent  of  the  criterion.  Then  obtain  measurements  of  pre¬ 
dictors  and  criterion  on  a  sufficiently  large  validation  sample  so  that  ever}'  pos¬ 
sible  configuration  of  predictor  values  is  represented  by  a  large  number  of 
cases.  Compute  the  criterion  mean  for  each  of  these  configurations.  To  make  a 
prediction  for  a  particular  case,  determine  the  configuration  of  the  predictors 
for  that  case.  The  prediction  will  be  the  criterion  mean  for  cases  in  the  valida¬ 
tion  sample  having  that  configuration. 
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Such  a  system  is  unworkable  because  of  practical  limitations  on  sample 
size  and  number  of  predictors.  Under  certain  circumstances,  moreover,  a 
much  simpler  system  could  give  equally  accurate  predictions.  If,  for  example, 
the  criterion  means  were  known  to  be  functionally  related  to  the  predictors, 
it  would  only  be  necessary  to  determine  this  function.  In  practice,  such  a 
functional  relation  is  virtually  always  assumed.  It  may  also  happen  that  a 
small  subset  of  all  variables  statistically  related  to  the  criterion  will  give  pre¬ 
dictions  as  accurate  as  the  entire  set.  Even  where  a  very  large  number  of 
independent  predictors  is  readily  available,  the  number  that  may  actually  be 
used  is  limited  by  the  available  sample  size.  This  is  because  it  is  necessary 
to  have  many  more  cases  than  there  arc  parameters  in  the  assumed  functional 
relation  between  predictors  and  criterion  mean.  Otherwise  one  could  not 
obtain  stable  estimates  of  these  parameters. 

In  least-squares  or  regression  theory  and  also  in  correlation  theory,  the 
mean  of  the  criterion  is  assumed  1o  be  a  linear  function  of  the  predictors.  In 
correlation  theory,  predictors  and  criterion  are  assumed  to  be  random  vari¬ 
ables  having  a  joint  multivariate  normal  distribution.  In  regression  theory, 
the  criterion  is  assumed  to  be  a  normally  distributed  random  variable,  while 
the  predictors  are  thought  of  as  being  fixed.  Anderson  (1958,  p.  61)  recom¬ 
mends  using  one  model  or  the  other  depending  on  whether  or  not  the  predictors 
may  be  considered  random.  Mood  (1950,  p.  312)  states  that,  in  practice,  most 
correlation  problems  can  be  more  appropriately  handled  by  regression  meth¬ 
ods.  In  many  cases,  the  two  models  have  led  to  equivalent  procedures;  under 
the  null  hypothesis,  estimates  of  regression  weights,  test  criteria,  and  prob¬ 
ability  theory  are  all  the  same.  However,  when  the  null  hypothesis  (viz.,  that 
predictors  and  criterion  arc  independent)  is  not  true,  the  probability  theory 
differs. 

In  prediction  problems  in  psychology,  the  predictor  variables  arc  generally 
random  rather  than  fixed,  and  the  null  hypothesis  is  rarely  true.  Thus  cor¬ 
relation  theory  would  appear  to  be  more  appropriate.  However,  since  correla¬ 
tion  theory  is  considerably  more  complex  and  difficult  to  apply  than  regression 
theory,  the  latter  is  generally  used,  with  the  hope  that  the  practical  differences 
between  conclusions  drawn  from  the  two  models  will  be  negligible.  In  the 
present  study,  prediction  problems  will  for  the  most  part  be  considered  within 
the  context  of  regression  theory. 

It  may  prove  useful  at  this  point  to  make  the  distinction  between  actual 
prediction  problems  and  validation  problems.  In  validation  problems,  the 
goal  is  to  demonstrate  a  systematic  relationship  between  a  number  of  “inde¬ 
pendent  variables"  and  a  “dependent  variable.”  To  accomplish  this,  one 
formulates  the  null  hypothesis  of  no  relationship  and  hopes  to  reject  it  at 
some  level  of  confidence.  Thus,  for  validation  problems,  correlation  theory 
and  regression  theory  are  equivalent.  In  prediction  problems,  on  the  other 
hand,  the  null  hypothesis  is  assumed  to  be  false.  The  goal  is  to  obtain  a 
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regression  equation  which,  when  applied  to  predictor  measures  in  future 
samples,  will  give  the  most  accurate  estimate  possible  of  the  corresponding 
criterion  values.  Having  obtained  such  a  regression  equation,  one  would  also 
wish  to  have  estimates  or  confidence  intervals  indicating  the  accuracy  to  be 
expected  when  the  regression  equation  is  applied  to  new  samples.  In  valida¬ 
tion  problems,  the  multiple  con-elation  is  often  used  as  a  measure  of  relation¬ 
ship  between  the  dependent  and  independent  variables.  It  is  sometimes 
termed  a  validity  coefficient,  or  simply  a  validity.  In  prediction  problems, 
the  correlation  between  the  prediction  and  the  criterion  in  new  samples  may 
be  used  as  a  measure  of  aecuracy  of  prediction.  Such  a  coefficient  may  be 
termed  a  weight-validity  to  distinguish  it  from  the  multiple  correlation 
coefficient  between  the  prediction  battery  and  the  criterion  in  the  original 
sample. 

Purpose  of  the  Study 

The  present  study  is  concerned  with  prediction  problems  as  opposed  to 
validation  problems.  Regression  theory  in  its  current  form  is  adequate  for 
those  applications  in  which  the  available  number  of  cases  far  exceeds  the 
available  number  of  predictors,  i.e.,  in  which  the  number  of  degrees  of  free¬ 
dom  is  large.  In  such  cases,  weight-validity  will  be  very  close  to  battery 
validity,  and  the  least-squares  estimates  of  the  regression  weights  will  provide 
optimal  predictions.  But  when  the  number  of  predictors  available  is  relatively 
large  in  relation  to  sample  size,  as  is  perhaps  more  often  than  not  the  case, 
problems  arise  that  lack  satisfactory  theoretical  answers.  One  such  problem 
is  that  of  estimating  an  index,  such  as  weight -validity,  that  will  provide  some 
idea  of  the  accuracy  of  prediction  to  be  expected  in  new  samples.  A  more 
important  problem  is  that  of  determining  the  regression  weights  which  will 
give  the  most  accurate  predictions  possible  in  new  samples. 

These  optimal  weights  will  not  in  general  be  given  by  the  conventional 
least-squares  solution  applied  to  all  available  predictors.  For  example,  if 
the  number  of  predictors  is  the  same  as  the  number  of  cases  in  the  sample,  the 
least-squares  weights  for  an  arbitrary  subset  of  predictors  wall  usually  give 
better  weight-validity  (though  lower  validity)  than  the  weights  for  the  entire 
set.  More  generally,  in  such  an  extreme  case,  any  lower-rank  approximation 
to  the  matrix  of  predictor  values  would  give  better  predictions  than  the 
complete  matrix.  As  the  situation  becomes  less  and  less  extreme,  there  must 
come  a  point  where  some  ranks  and  some  methods  of  rank  reduction  and  not 
others  are  preferable  to  the  complete  matrix.  At  a  still  less  extreme  point,  the 
entire  set  of  predictors  will  presumably  give  better  predictions  than  any 
reduced-rank  approximation.  Still,  when  predictors  are  discarded,  the  loss  of 
accuracy  of  prediction  may  be  so  slight  as  to  be  more  than  offset  by  the  prac¬ 
tical  savings  of  not  having  to  measure  as  many  predictors. 

Thus  in  any'  prediction  problem  where  the  number  of  degrees  of  freedom 
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is  limited,  the,  question  of  rank  reduction  arises:  can  the  complete  predictor 
matrix  be  improved  upon,  and  if  so,  which  method  of  reduction  and  nhich 
rank  will  give  the  greatest  improvement?  When  its  purpose  is  to  give  more 
accurate  prediction  by  increasing  degrees  of  freedom,  the  much-studied 
predictor  selection  problem  is  a  special  case  of  the  rank-reduction  problem. 
Predictor  selection  methods  are  more  often  used,  however,  in  situations  where 
an  upper  limit  on  the  size  of  the  prediction  battery  is  given  by  considerations 
of  cost.  The  emphasis  is  thus  on  obtaining  an  optimal  set  of  predictors  of  a 
particular  size  rather  than  on  obtaining  optimal  predictions  regardless  of 
battery  size.  Perhaps  because  of  the  prevalence  of  the,  former  emphasis, 
particularly  before  the  advent  of  electronic  computers,  the  problem  of  pre¬ 
dictor  selection  has  received  a  great  deal  more  attention  than  the  general 
problem  of  rank  reduction. 

Most  methods  of  predictor  selection  are  alike  in  selecting  first  the  variable 
having  the  highest  single  validity,  and  adding,  step  by  step,  the  variable 
which,  together  with  those  previously  selected,  will  give  the  greatest,  increase 
in  the  multiple  correlation  with  the  criterion.  These  so-called  accretion 
methods  differ  with  respect  to  computational  procedure  and  method  of 
deciding  how  many  predictors  to  use.  Perhaps  the  computationally  simplest 
such  method  is  the  square-root  (or  triangular-factoring)  method  described 
by  Summerfield  and  Lubin  (1951).  Horst  has  generalized  and  extended  this 
method  for  absolute  (1955)  and  differential  (1954)  prediction  of  multiple 
criteria.  Horst  and  MacEwan  (1900)  have  described  a  method  which  is 
essentially  the  reverse  of  the  accretion  method.  Here  one  eliminates  at  each 
step  the  predictor  contributing  least  to  the  multiple  correlation.  The  accre¬ 
tion  and  elimination  methods  will  not  in  general  result  in  the  same  battery, 
nor  will  either  of  them  necessarily  give  the  battery  of  given  size  having  the 
highest  obtainable  validity. 

Horst  (1941)  has  suggested  two  models  for  reduced-rank  prediction.  His 
rationale  is  based  upon  the  factor  analysis  hypothesis  that  the  predictor  matrix 
is  basic  only  because  of  the  presence  of  error  or  specific  factors.  One  of  these 
models  assumes  the  presence  of  specifics.  Accordingly,  the  matrix  of  predictor 
intercorrelations  is  augmented  by  the  vector  of  criterion  correlations  and  com¬ 
monality  estimates  arc  placed  in  the  diagonal  prior  to  factoring.  Least-squares 
weights  arc  then  computed  for  the  common  factors.  This  method  was  tested 
by  Leiman  (1951)  using  12  predictors  and  computing  weights  on  samples  of 
30  cases.  A  rank-3  solution  gave  weight-validities  which  were  significantly 
higher  than  those  obtained  with  the  full-rank  solution.  This  method  has  the 
disadvantage  of  being  difficult  to  treat  theoretically,  since  the  nature  of 
communalities  and  of  the  factor  scores  (which  are  not  unique)  arc  not  well 
understood.  The  other  model  suggested  by  Horst  accomplishes  rank  reduction 
by  attempting  to  remove  error  factors  rather  than  specific  factors.  Here  the 
best  least-squares  approximation  to  the  predictor  intcrcorrelation  matrix  is 
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used,  the  principal-axes  solution.  One  advantage  of  this  method  is  that  it  is 
theoretically  straightforward.  Another  advantage  is  that  rank  reduction  is 
accomplished  independently  of  the  criterion  and  thus  does  not  capitalize 
on  the  errors  in  the  criterion. 

Virtually  the  exact  opposite  of  this  model  has  been  implicitly  suggested 
by  Guttman  (195S).  Since  the  inverse  of  the  predictor  correlation  matrix  is 
directly  involved  in  computing  regression  weights,  one  might  well  base  pre¬ 
dictions  on  the  best  lower-rank  approximation  to  the  inverse  rather  than  on 
the  approximation  to  the  intercom* lation  matrix.  The  best  set  of  factors  for 
approximating  the  inverse  is,  as  Guttman  points  out,  the  worst  for  approxi¬ 
mating  the  intercorrelation  matrix.  In  view  of  this  paradox,  perhaps  one 
should  abandon  approximation  as  a  criterion  for  selecting  the  factors  to 
be  retained  for  prediction  and  simply  use  those  factors  giving  the  highest 
multiple  correlation,  as  is  attempted  in  the  predictor-selection  methods. 
Certainly  the  basic  assumption  of  the  rationale  for  approximating  the  inter- 
correlation  matrix  may  be  questioned:  that  the  reliable  variance  is  concen¬ 
trated  in  the  larger  prinopal-axes  factors,  the  smaller  factors  being  composed 
mainly  of  error.  For  example,  in  a  study  by  Davis  (1945)  involving  nine 
principal-axes  factors,  a  strict  correspondence  between  variance  contribu¬ 
tion  and  reliability  was  not  found;  e.g.,  the  split-half  reliability  for  the  eighth 
factor  was  larger  than  for  the  fourth  factor. 

The  present  study  proceeds  along  both  theoretical  and  empirical  lines. 
First  an  attempt  is  made  to  work  out  some  of  the  consequences  of  regression 
theory  for  reduced-rank  models.  Since,  as  noted  above,  there  is  reason  to 
question  the  appropriateness  of  regression  theory  for  psychological  predic¬ 
tion  problems,  an  empirical  comparison  of  five  reduced-rank  procedures  is 
also  carried  out.  The  methods  used  were  predictor  elimination,  predictor 
selection,  the  method  of  approximating  the  intercorrolation  matrix,  the 
method  of  approximating  the  inverse,  and  the  method  using  the  principal-axes 
factors  giving  the  highest  multiple  correlation.  As  will  be  seen,  both  the 
theoretical  and  the  empirical  evidence  favors  the  method  of  approximating 
the  intercorrelation  matrix. 


CHAPTER  2 


IMPLICATIONS  OF  REGRESSION  THEORY  FOR 
REDUCED  RANK  MODELS 

The  General  Linear  Hypothesis 

Regression  theory  was  first  worked  out  at  the  beginning  of  the  19th 
century  by  Gauss  and  Legendre  and  has  since,  of  course,  been  presented  by 
innumerable  authors  from  various  points  of  view.  Among  recent  sources,  a 
rigorous  presentation  with  geometrical  interpretations  has  been  given  by 
Schcffc  (1959).  A  simpler  presentation  entirely  in  terms  of  matrix  algebra 
is  given  by  Kcmpthornc  (1952).  Anderson  (1958)  provides  a  generalization 
to  multiple  criteria.  A  presentation  in  terms  of  deviation  scores  may  be  found 
in  Cramer  (1946).  Some  results  from  regression  theory  which  arc  relevant  to 
the  rank-reduction  problem  are  summarized  below.  The  derivations,  which 
arc  for  the  most  part  omitted,  may  be  found  in  the  sources  mentioned  above. 
Let 

y  be  a  column  vector  of  N  observations  on  the  criterion ; 
x  be  an  N  X  M  matrix  of  rank  M  <  N,  each  row  of  which  represents 
an  observation  on  each  of  M  predictors; 
e  be  an  Arth-ordcr  column  vector  of  uncorrclatcd  errors,  each  dis¬ 
tributed  normally  with  mean  zero  and  variance  a2 ; 

/9  be  an  M  X  1  vector  of  population  regression  coefficients; 

C  be  a  covariance  matrix  of  the  variable  given  in  the  subscript. 

The  general  linear  hypothesis  is  that 

(1)  y  =  .r/3  +  e. 

The  assumptions  regarding  e,  apart  from  normality,  may  be  stated  as 

(2)  m  =  o, 

(3)  C.  =  L(ee')  =  a2 1. 

From  these  equations  it  follows  that  the  criterion  has  the  expectation 

(4)  E{y)  =  .r/3, 
and  the  covariance  matrix 

(5)  Cv  =  E[(y  -  xfi){y  -  zfl)']  =  *7. 
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Let 

4  be  the  M  X  1  vector  of  least-squares  estimates  of  the  population 
regression  coefficients ; 

y  be  the  If  X  1  veetor  of  estimates  of  the  criterion  based  on  4- 

Then 

(6)  4  =  (x'xY'x'y, 
and 

(7)  y  =  xi 3. 

The  vector  4  has  the  property  of  minimizing  the  sum  of  squares  of  the  errors 
in  estimating  y  from  y.  These  errors  will  be  orthogonal  to  the  predietors  and 
also  to  the  estimates  themselves.  The  error  sum  of  squares  has  the  expectation 


(8) 

Thus 

(9) 


E[(y  -  y)'(y  -  y)]  =  {N  -  M)J. 


*2 

a 


(?/  -  ??)'(?/  -  y) 
N  -  M 


provides  an  unbiased  estimate  of  a2.  What  is  generally  termed  the  standard 
error  of  estimate  is  given  by  &.  The  variable  &2  is  distributed  independently 
of  4. 

The  estimates  of  the  regression  coefficients  have  the  expeetation 

(10)  E0)  =  0, 
and  the  covarianee  matrix 

(11)  Ct  =  E[0  -  p)0  -  /3)']  =  c\x'x)-\ 

The  estimates  of  the  criterion  have  the  same  expectation  as  the  eriterion 
itself, 

(12)  E(y)  =  E(xj3)  =  xE0)  =  .t/3, 
but  are  not  independent,  sinee  from  (7),  (11),  and  (12), 

(13)  Cs  =  E[(xj3  —  x/5)(z4  —  a^8)']  =  xCgx'  =  <r2x(x'x)~1x'. 

The  canonical  form  of  the  general  linear  hypothesis  may  be  obtained 
as  follows.  Let  x  be  expressed  as 

(14)  x  =  ub' , 

where  u  is  an  N  X  M  orthonormal  matrix  of  f aetor  scores,  and  b  is  an  if/  X  M 
matrix  of  factor  loadings.  Let  V  be  an  N  by  N  —  M  orthonormal  matrix 
sueh  that  the  N  X  N  matrix  II  in 


(15) 


H  =  [u  v  ] 


s 
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is  an  orthonormal  matrix.  The  matrices  u,  b,  and  v  arc  always  obtainable, 
and  can  be  determined  solely  from  the  predictors  without  reference  to  the 
criterion.  Then  the  A’th-order  vector  of  transformed  criterion  values 


(16) 


has  the  expectation 
(17) 


L  z2  i 


i  tt'y] 

i'ii 


l--(zd 

b'f , 

Ufc) 

0  J 

and  the  covariance  matrix 


(18) 


a~I . 


Thus  the  best  possible  predictions  for  the  A  —  M  transformed  observations 
z2  will  always  be  zero,  regardless  of  the  true  regression  coefficients  or  of  the 
particular  values  of  the  criterion.  The  least-squares  estimates  of  the  regression 
weights  are  so  chosen  as  to  reproduce  exactly  the  M  transformed  observations 


zx  from 

(19) 

2,  =  u'y  =  b'i 5, 

so  that 

(20) 

/5  =  b'~'u'y. 

Equation  (20)  may  also  be  obtained  by  putting  (14)  in  (G).  Thus,  errors  can 
occur  only  in  estimating  z2,  and  since  the  estimated  value  of  z2  is  zero,  we  have 

(21)  (y  -  y)'(y  -  fi)  =  z2'z2. 


Metric  and  the  Status  of  the  Multiple  Correlation 

In  regression  theory,  the  multiple  correlation  coefficient  and  other  func¬ 
tions  of  the  predictors  such  as  means,  standard  deviations,  and  covariances 
do  not  have  the  status  of  population  parameters.  This  is  because  the  predietors 
are  not  assumed  to  be  random  variables  but  rather  fixed  values.  Thus,  regres¬ 
sion  theory  does  not  admit  of  statistical  inferences  about  such  functions. 
However,  one  can  make  statistical  inferences  about  such  characteristics  of 
future  samples  as  depend  on  the  criterion,  provided  that  the  relevant  features 
of  the  predictor  matrix  in  the  future  samples  are  assumed  to  be  known  in 
advance.  lror  example,  one  can  assume  that  exactly  the  same  predictor  matrix 
will  be  obtained  in  future  samples  or  merely  that  the  predictor  intercorrela¬ 
tions  will  be  the  same.  Using  the  latter  assumption  and  scaling  the  criterion 
appropriately,  one  can  define  both  a  sample  and  a  population  multiple  cor¬ 
relation  coefficient. 
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Except  where  correlations  arc  concerned,  no  assumptions  about  metric 
arc  made  in  the  present  paper.  However,  it  should  be  noted  that  if  the  equa¬ 
tions  of  the  preceding  section  were  to  be  applied  to  data  in  the  original  units 
of  observation,  a  correction  for  origin  would  be  required.  This  correction  will 
be  accomplished  if  a  predictor  is  added  which  is  defined  to  be  unity  for  all 
cases.  If  this  is  done,  equation  (G)  of  the  preceding  section  may  be  shown  to 
be  identical  to  the  usual  formulas  for  raw-score  regression  weights,  which 
are  typically  expressed  in  terms  of  means  and  covariances  or  correlations  and 
standard  deviations. 

The  question  of  metric  also  arises  in  connection  with  defining  multiple 
correlation.  The  assumption  made  here  whenever  correlation  coefficients  are 
discussed  is  that  all  measures  are  normalized,  i.e.,  expressed  as  deviations 
from  the  sample  mean  in  units  of  tire  sample  standard  deviation  multiplied 
by  the  square  root  of  the  number  of  cases  in  the  sample.  We  may  now  define 
the  square  of  the  multiple  correlation  in  the  sample  as 

(22)  R2  =  =  y'x{x'x)~ljc'y 

and  in  the  population  as 

(23)  p2  =  p'x'xp. 

If  we  let  r  be  the  M  X  3/  matrix  of  predictor  intercorrelations,  (23)  may  be 
written  as 

(24)  P2  =  /3V/3, 

since,  on  the  basis  of  the  assumption  about  the  metric, 

(25)  r  =  x'x. 

Thus  p  will  be  a  population  parameter  if  it  is  assumed  that  the  predictor 
intercorrelations  will  be  the  same  in  all  samples. 

An  unbiased  estimate  for  p  may  be  obtained  as  follows.  The  expectation 
of  the  criterion  sum  of  squares  is,  from  (l), 

(26)  E(y’y)  =  A[(.r/3  +  e)'(.r/3  +  e)\  =  fi'x'xfi  +  2/3'x'A(e)  +  E(e'e). 

From  (23),  the  first  term  on  the  right  is  p2  and  from  (2)  the  second  term  is 
zero.  The  third  term  is  the  trace  of  (3).  Thus 

(27)  E(ij’y)  =  p2  +  No2. 

Since  the  errors  of  estimate  are  orthogonal  to  the  estimates,  we  have 

(28)  y’y  =  y'y  +  (y  -  y)’{y  -  y). 

From  (7)  and  (22),  the  first  term  on  the  right  is  R2.  Thus  from  (8)  and  (27), 

(29)  E(R2)  =  E(y'y)  -  E[(y  -  y)'(y  -  y)} 

=  p2  +  No2  -  (N  -  M)o2  =  p2  +  Mo2. 
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Given  the  assumed  metric,  the  criterion  sum  of  squares  will  always  be  unity, 
so  from  (27), 


(30) 


and  (29)  may  be  written  as 

(31)  E(R 2)  =  p2  +  3/(1  ~  p2)- 


From  (31)  it  is  clear  that  the  extent  to  whieli  R2  overestimates  p2  will  vary 
directly  with  the  number  of  predictors  and  inversely  with  the  sample  size. 
Solving  equation  (31)  for  p2  wc  obtain  the  following  unbiased  estimate  for  p2: 


(32) 


R2c  = 


NR 2  -  M 
N  -  M  ' 


Equation  (32)  will  be  recognized  as  the  familiar  “shrinkage”  formula  for 
multiple  R. 

It  is  perhaps  worth  noting  that  Rc,  or  “shrunken  R”  is  not  an  estimate 
of  weight-validity  or  of  the  shrinkage  to  be  expected  in  the  correlation  be¬ 
tween  the  criterion  and  its  estimate  if  weights  computed  on  one  sample  are 
applied  in  other  samples.  It  does  provide  an  estimate  of  the  correlation  that 
would  have  been  obtained  between  the  criterion  and  its  estimate  if  the  popula¬ 
tion  regression  weights  had  been  used  instead  of  their  least-squares  estimates. 
Shrunken  R  may  also  be  thought  of  as  an  estimate  of  the  multiple  R  that  could 
be  obtained  in  a  very  large  sample  having  the  same  predictor  intercorrelation 
matrix  as  the  observed  sample. 

The  Accuracy  of  Prediction  in  Future  Samples 

In  prediction  problems  we  wish  to  compute  a  set  of  weights  from  a  given 
sample  which  will  give  the  most  accurate  predictions  obtainable  when  applied 
to  other  samples.  Specifically,  we  will  assume  that  the  sum  of  squares  of  the 
errors  of  prediction  in  each  other  sample  is  the  quantity  to  be  minimized. 
If  we  let  ft  be  a  set  of  weights  obtained  in  some  fashion  from  a  previous 
sample,  this  sum  of  squares  may  be  written  (Kempthornc,  1952)  as 

(33)  ( y  -  xP)'(y  -  xj})  =  (y  -  xpy(y  -  a;j§) 

+  e'x(x'x)~1x'e  +  2 03  —  fi)'x'e  +  (/?  —  /S)Vx(/3  —  /3). 

The  expected  value  is 

(34)  E[(y  -  xi B)'(y  -  z/3)]  =  NY  +  (J3  —  j3)'x'x(j3  -  ft). 

Now  the  second  term  on  the  right  has  an  expectation  in  the  sample  from 
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which  /3  was  obtained.  Assuming  that  the  usual  least-squares  estimates  are 
employed,  we  have,  using  equation  (11), 

(35)  E[ 03  -  /3)V.r(0  -  ft]  =  tr  [E[x(j3  -  ft(/3  -  /3)V]] 

=  tr  (xC$xr)  =  a  tr  [x(x'x)~'x']. 

Using  (14),  we  may  write  the  matrix  whose  trace  we  require  as 

(3G)  x(x'x)~'x'  =  ub'(bb')~1bu'  =  ub'b'~'b"'bii'  =  uu'. 

Putting  (36)  in  (35),  we  may  write 

(37)  Z?[(3  —  /3)V.t(/3  —  j3)]  =  a  tr  (uu1)  =  a  tr  ( u’u )  =  cr2  tr  (/)  =  il/o-2. 

Now  if  we  assume  that  x'x,  or  equivalently  the  factor-loading-matrix  b,  is 
the  same  in  all  samples,  we  would  expeet  the  sum  of  squares  of  errors  of  pre¬ 
diction  to  be  (N  +  M)a.  More  generally,  if  /Sis  any  estimate  of  /3  computed 
from  the  original  sample,  we  would  expeet  the  sum  of  squares  of  errors  of 
prediction  in  future  samples,  provided  that  the  factor-loading  matrix  is  the 
same  as  in  the  original  sample,  to  be 

(38)  #  =  AV  +  E[( ft  —  0)'x'x(J3  —  0)]. 

Thus  i fa  will  be  taken  as  an  inverse  index  of  W'cight-cfficicncy :  the  smaller 
it  is,  the  more  suitable  0  will  be  for  a  prediction  problem.  In  particular, 

(39)  h  =  W  +  M)a2. 

Sinee  the  interpretation  of  (38)  is  basic  to  the  following  development,  we 
will  examine  its  derivation  with  some  care.  Certainly  <{/$  is  not  a  mathe¬ 
matical  expectation  in  the  usual  sense,  but  rather  an  expectation  of  an  expecta¬ 
tion.  Since  N,  <r2,  /3,  and  (by  assumption)  x’x  are  fixed,  the  expectation  in 
(34)  is  a  function  of  /3,  and  is  thus  fixed  as  soon  as  the  original  sample  is 
drawn.  Since  this  quantity  is  a  function  of  the  criterion  in  the  original  sample, 
its  expectation  in  this  sample  is  tpti-  The  quantity  that  we  are  directly  eon- 
eerned  with  minimizing  is  the  one  in  (34).  This  quantity  is  itself  not  deter¬ 
mined  in  advance  of  drawing  the  first  sample,  blit  its  expectation  is  deter¬ 
mined.  Rather  than  minimize  the  quantity  of  direct  interest,  then,  we  attempt 
to  minimize  its  expectation. 

An  estimate  of  weight- validity  may  be  obtained  from  (39).  Assuming 
the  metrie  of  the  previous  section,  and  using  (9)  and  (22), 


(40) 


.2  v'y  —  y’y  =  1  —  R2 
a  N  —  M  N  —  M 


Thus,  an  unbiased  estimate  for  i/q  is,  from  (39) 

(41)  h  =  fur# (1  -  R2)- 
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For  an  arbitrary  set  of  weights  /3,  the  weight-validity  is 

(42)  IE  = 

\  p  x  xfi 

The  sum  of  squares  of  errors  of  prediction  is 

(43)  S  =  (y  -  xfi'fj)  -  xp)  =  1  -  2y'x&  +  p'x'xp. 

If  (42)  is  substituted  in  (43), 

(44)  S  =  .1  -  2WV0’x'xfi  +  B’x'xB  . 

Since  p  is  the  vector  of  least-squares  weights  from  the  original  sample,  under 
the  assumption  that  x'x  is  constant,  the  radical  in  the  second  term  on  the 
right  of  (44),  and  the  third  term  on  the  right  become,  respectively,  R  and  R2 
of  the  original  sample.  Solving  (44)  for  IE  gives 

(45)  W  =  —~fR— 


Now  to  obtain  an  estimate  of  TE,  we  substitute  for  S  in  (45)  the  estimate  of 
its  expectation  given  by  (41).  Simplifying,  wc  obtain 


(40) 


,«v  Nit2  -  M 

R{N  -  M) 


To  see  the  relation  of  the  estimated  weight-validity  to  the  estimated  popula¬ 
tion  multiple  correlation  as  defined  in  the  preceding  section,  we  put,  (32)  in 
(40),  obtaining 


(47) 


IF  =  = 


R c  R< 


R 


Re 


Since  Rc  is  less  than  It  (unless  It  is  unity),  the  left-hand  factor  on  the  right 
of  (47)  will  be  less  than  one,  so  IE  will  be  less  than  Rc- 

Perhaps  a  more  important  application  of  (38)  is  its  use  as  a  criterion 
for  evaluating  reduced-rank  models  for  computing  regression  weights.  An 
alternate  approach  is  indirectly  suggested  by  Leiman  (1951,  pp.  3-4).  There, 
the  assumption  is  made  that  the  least-squares  weights  for  the  lower-rank 
system  will  give  better  predictions  than  least-squares  weights  for  the  full- 
rank  system  to  the  extent  that  they  provide  closer  approximations  to  the 
population  regression  weights  for  the  full-rank  battery.  The  reason  for 
rejecting  this  position  is  as  follows:  It  is  well  known  that  the  optimal  weights 
for  a  subset  of  predictors  may  differ  greatly  from  the  weights  of  the  same 
predictors  when  the  full  battery  is  retained.  A  mathematical  statement  of 
this  fact  b  given  in  (104).  Thus  one  cannot  properly  measure  the  suitability  of 
a  reduced-rank  set  of  weights  in  terms  of  how  closely  they  approximate  the 
full-rank  weights.  It  seems  more  likely  that  the  least-squares  weights  for 
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a  subset  of  predictors  or  of  factor  scores  may,  because  of  the  increased  number 
of  degrees  of  freedom,  be  so  much  more  stable  than  the  weights  for  the  full 
set  as  to  give  more  accurate  predictions  despite  the  loss  of  information.  In 
any  case,  the  criterion  in  (38)  involves  no  assumptions  other  than  those 
usually  made  in  applications  of  regression  theory  to  prediction  problems  and 
is,  moreover,  referred  directly  to  the  expected  errors  of  prediction. 

In  evaluating  reduced-rank  solutions,  a  question  arises  as  to  the  number 
of  factors  to  be  included  in  the  general  linear  hypothesis.  If  the  full-rank 
hypothesis  is  retained,  then  the  quantity  No*  in  (38)  is  fixed,  so  that  the  only 
way  of  improving  on  ft  will  be  to  find  a  ft  for  which  the  second  term  is  less  than 
21/ or2.  If,  however,  a  smaller  set  of,  say,  L  predictors  (cither  the  original  ones 
or  factor  scores)  is  hypothesized,  both  terms  change.  The  variance  of  the 
errors,  a2,  will  of  course  increase  in  proportion  to  the  systematic  variance 
in  the  criterion  accounted  for  by  the  discarded  predictors.  If  we  denote  this 
larger  variance  by  <r£  and  the  least-squares  weights  for  the  reduced  battery 
by  ft,  then 

(48)  =  (iV  +  L)a  i, 

as  will  be  seen  in  the  next  section.  Thus  the  ft  for  any  subset  of  L  predictors 
for  which  (N  L)al  is  less  than  (N  +  M)a2  will  be  an  improvement  over  ft. 

Another  possible  application  of  (38)  would  be  in  obtaining  a  criterion 
for  how  many  predictors  to  retain  in  the  standard  predictor-selection  pro¬ 
cedures.  If  we  denote  by  R r.  the  multiple  correlation  obtained  with  a  set  of 
L  predictors,  this  criterion  is  obtained  directly  from  (41): 

(49)  h  =  (!  -  Rd- 

One  would  retain  those  L  predictors  for  which  \j/$  is  the  smallest.  Wc  use 
rather  than  IF  since  weight-validity  is  an  indication  not  of  the  actual  errors 
of  prediction  but  of  the  errors  which  would  have  been  obtained  if  the  predic¬ 
tions  could  themselves  have  been  weighted  after  the  criterion  had  been 
observed.  In  other  words,  a  correlation  coefficient  between  two  variables  is 
independent  of  differences  in  location  and  scale,  whereas  actual  errors  of 
prediction  are  in  part  determined  by  such  differences. 

The  General  Reduced-Rank  Model 

The  reduced-rank  solution  will  first  be  developed  in  terms  of  a  general 
factor  model.  Predictor  selection  and  prediction  from  principal-axes  factors 
will  then  be  considered  as  special  cases  of  this  model.  Let 

(50)  x'x  =  bb' 
be  any  complete  factoring  of  x'x.  Then 

(51)  u  =  x(b')~‘ 
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will  bo  the  orthonormal  matrix  of  factor  scores.  The  matrices  x,  u,  and  b  are 
the  same  as  those  in  (14).  Now  we  partition  u  and  b  after  the  Lth  column  so 
that,  from  (14), 


(52) 


x  =  [m, 


=  uj)[  +  V2V2- 


We  will  assume  that  the  columns  of  u  and  b  have  been  permuted  so  that  the  L 
factor  seores  retained  for  prediction  arc  given  by  u,,  or  (if  one  prefers  to 
think  of  prediction  from  a  rank-L  approximation  to  .r)  by  u,b[.  We  will  now 
show  that  the  two  assumptions  are  equivalent  for  prediction  problems.  Note 
first,  however,  that  in  future  samples  the  weights  must  be  applied  to  the 
predictors  rather  than  to  the  factor  seores  or  to  the  lower-rank  approxima¬ 
tion.  The  latter  must  be  obtained  as  a  row  transformation  of  the  prediction 
matrix,  since  a  prediction  equation  must  be  applicable  to  individual  cases. 

Let  the  inverse  of  b  be  conformably  partitioned  and  denoted  by  B'  so  that 


(53) 

1 Vb  = 

B{  '  [6.6J  = 

B'A 

BA 

B'2J 

BA 

BA- 

0  L 

(54)  ux  =  xB , 

is  a  unique  solution  for  ux  as  a  transformation  on  the  rows  of  x.  To  see  this, 
we  let  7  be  any  other  such  transformation,  and  let 

(56)  E  =  7  -  B,. 

Then 

(56)  ii  [  =  xy  =  .r/?,  -{-  xE  —  tit  ~h  xE 
so  that 

(57)  xE  —  0, 

which,  since  *  is  basie,  implies  that  E  is  zero.  Now  let  k  be  a  set  of  least- 
squares  weights  for  iq.  Sinee  ux  is  basie,  k  is  unique.  Let  k  be  a  sot  of  least- 
squares  weights  for  utb[.  Since  is  nonbasie,  k  is  not  unique.  If 

(58)  ihb'Jt,  —  y  = 

and 

(59)  uA  —  y  = 

the  sums  of  squares  of  e6  and  of  e„  will  be  minimized  by  and  du,  respectively. 
The  former  sum  of  squares  ean  be  no  less  than  the  latter,  for  we  could  always 
take 

(60) 


k  =  b'A. 
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The  two  sums  of  squares  will  be  equal  if  we  let 

(61)  A  =  B&. 

Therefore,  a  set  of  least-squares  weights  for  (58)  will  be  given  by  A  in  (61) 
and 

(62)  {'it-.  =  erueu. 

But  sinec  A  is  unique,  b[ ft,  must  be  unique,  and  (60)  holds  for  all  least-squares 
solutions  A  of  (58).  Thus,  (58)  and  (59)  are  identical,  and  beeause  of  the 
uniqueness  of  71,  in  (54),  we  have 

(63)  i3  =  DA 

as  a  unique  set  of  least-squares  weights  for  x  under  the  assumption  of  redueed 
rank. 

If  it  is  assumed  that  the  criterion  depends  solely  on  the  subset  of  L 
faetors  retained  for  predietion,  the  general  linear  hypothesis  takes  the  form 

(64)  y  -  +  ct, 

where  x,  y,  and  eL  are  defined  in  the  first  seetion  of  this  chapter.  All  of  the 
results  of  that  seetion  may  be  obtained  for  the  present  hypothesis  if  we 
replaee  x  by  xD,  and  (3  by  A  in  (1)  through  (13).  In  like  manner,  (48)  may  be 
obtained  from  the  derivation  of  (39).  Thus,  from  (6)  and  (54)  the  least- 
squares  estimate  of  A  is  given  by 

(65)  A  =  K«i  T'<V  =  u[y. 

It  has,  from  (10),  the  expeetation 

(66)  E{  A)  =  A 
and,  from  (11),  the  covarianee  matrix 

(67)  C$u  =  <rl(w(i*i)_I  =  <YJ  ■ 

An  unbiased  estimate  of  the  veetor  of  weights  to  be  applied  direetly  to  the 
predietors  is  given  by  0  as  defined  in  (63),  sinee 

(68)  E(fi)  =  E(BA)  =  BMu)  =  BA. 

The  eovariance  matrix  for  these  weights  will  be 

(69)  O  =  E[(BA  ~  BAXBA  ~  BAY]  =  BfitJY  =  vlB^. 

The  estimates  of  the  criterion  will  now  be,  from  (7), 

(70)  yL  =  xBt  A  =  xfi- 

The  expected  sum  of  squares  for  the  errors  of  estimate  becomes,  from  (8), 

(71)  E[(y  -  yL)'(y  -  yL)]  =  (N  -  L)<r2L. 
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The  matrix  II  for  transforming  the  criterion  observations  to  canonical 
form  may  take  exactly  the  same  form  as  in  (15): 

(72)  //  =  («,  u,  v ). 

The  matrix  [w-2  c]  is  now  arbitrary  to  the  extent  that  only  v  was  arbitrary 
before.  It  will  be  convenient,  however,  to  define  II  as  in  (72).  Partitioning 
the  transformed  observations  somewhat  differently  from  the  way  it  was  done 
in  (10),  we  let 


My 

(73) 

z  = 

Z  2 

=  Ily  = 

* 

\My 

«3 

My 

The  elements  of  z2  and  23  will  all  have  expected  values  of  zero,  while  the 
expectation  of  z,  will  be 

(74)  E(zx)  =  E(u[y)  =  E0U)  =  p„. 

The  unbiased  estimate  for  <r,2  may  be  expressed  in  terms  of  z2  and  z3  as 


(75) 


a,.  = 


Z&2  +  Zfea 
N  -  L 


The  implications  of  using  a  reduced-rank  solution  instead  of  the,  con¬ 
ventional  solution  can  perhaps  be  better  understood  if  the  full-rank  hy¬ 
pothesis  of  (1)  is  retained,  rather  than  the  rank-L  hypothesis  of  (G4).  We 
first  observe  that  P  is  a  biased  estimate  of  /3,  since 

(70)  im  -■  E(BMy)  =  BMxfi  =  BMP. 

Its  covariance  matrix,  which  will  now  be  proportional  to  cr2  instead  of  to  <r 
is  given  by 

(77)  C$  =  E[{B{u[y  -  BMP)(BMv  ~  BMP)']  =  BMMee'uPjB] 
since  premultiplying  (1)  by  u[  gives 

(78)  u'y  =  bp  +  t£,'e. 

Continuing,  with  (3)  in  (77), 

(79)  Ci  =  BME(cc’)uj:i  =  o2B,B[- 
The  first  and  second  moments  about  ft  will  be 

(80)  E(fi  -  p)  =  BMP  ~  P  =  -(/  -  BM)P  =  -BMP 

and 

(si)  Blip  -  m  ~  PN 

=  C9  +  [E(p  -  -  p)Y  =  a2BtB[  +  BMPPMB i. 
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Equation  (11)  may  be  written  as 


(82)  Cfi  =  a\x'x)~l  =  aBW  =  +  a2B2B'2. 


Thus,  from  the  standpoint  of  relative  efficiency  (Mood,  1950,  p.  149)  in 
estimating  /3,  8  and  /S  may  be  compared  in  terms  of  the  diagonals  of  the 
rightmost  terms  of  (SI)  and  (S2).  If  the  trace  of  the  former  is  smaller,  on 
the  average  the  reduced-rank  estimates  will  be  more  efficient  than  the  full- 
rank  estimates. 

The  expected  value  of  z  as  given  by  (73)  will  now  be 


\  ?/,  vP 

w) 

Vg\ 

v  'x(3_ 

-  0  J 

We  recall  from  (19)  that  $  is  computed  so  that 


Zi  _  [~Wd 

_zj  L&sisJ 


But  jf  is  computed  to  reproduce  only  zv : 


(So)  s,  =  u(y  =  b'lBlu'ly  =  b(fi. 

We  have 

(86)  b’J  =  b’,B  My  =  o. 

Thus,  the  reduced-rank  solution,  in  effect,  predicts  a  value  of  zero  for  z2 
rather  than  a  value  of  b'2p.  If  the  elements  of  b2@  are  smaller  than  or2,  then 
the  prediction  of  zero  would  have  the  higher  relative  efficiency. 

The  statistic  &l  will  be  an  overestimate  of  a".  To  see  this,  first  note  that 


(87)  E(z2z2  +  Z3Z3)  -  tr  [E(z2z'2) ]  +  tr  [E(z2z'3)\ 

=  tr  (a2 1  +  b'288'b2)  +  tr  ( <r2T) 

=  (.17  -  L)a2  +  S'bMfl  +  (N  -  M)a2 
=  (K  -  L)a2  +  d’b2b'20. 

Then  from  (75), 


(88) 


E(&1)  =  <j2  + 


8'b2b'20 
N  -  L 


Next,  we  describe  the  effect  of  hypothesized  rank  on  our  inverse  index 
of  weight-efficiency,  ipe-  We  will  denote  this  index  and  its  estimate  by  and 
where  the  full  rank  717  is  assumed,  and  by  Lipp  and  L  where  the  reduced- 

rank,  L,  is  assumed.  Mathematical  expectation  under  the  hypothesis  of  full 
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rank  will  be  denoted  by  EM(  )  and  under  the  hypothesis  of  reduced-rank 

by  El{). 

The  reduced-rank  index  L\ p$  was  given  by  (48).  To  obtain  the  full-rank 

index,  we  first  evaluate  the  rightmost  term  in  (38).  Using  (81), 

(SO)  A’,, [03  -  d)V*(/3  -  ft]  =  tr  [xE(0  -  0)(fi  -  ff)  ’]x') 

=  a2  tr  (xBJitf)  +  tr  (xB2b^'b2Bix') 

=  a2  tr  (m,u{)  +  tr  (u2b'2pp'b2u'2) 

=  <r2  tr  («[«,)  +  &'b2u'2n2b’$ 

=  La-2  +  fi'b2b2{l. 

Substituting  (S9)  in  (38),  we  obtain 

(90)  Mfa  =  (AT  +  LV  +  13'bM 3. 

An  unbiased  estimate  of  L\}/p  is,  from  (75)  and  (48), 

(91)  L<Ps  =  (N  +  L)& l  =  z'2z2  +  z'2z2  +  v27y  r  (z2z2  +  z'3z3). 

Ail  unbiased  estimate  of  is,  from  (87), 

(92)  ufo  =  z'2z2  +  zfe3  +  y  p  y  z'3z3. 

The  latter  will  also  be  an  unbiased  estimate  of  L  fa,  since 


It  would  not,  however,  be  as  stable  an  estimate  as  Lfa,  since  the  rightmost 
term  of  (91)  is  based  on  more  observations  than  the  rightmost  term  of  (92). 
If  Lfa  were  used  to  estimate  ufa,  it  would  have  a  positive  bias,  since,  from 
(88)  and  (90), 

(94)  EM(Lfa)  =  (N  +  L)[a2  +  ppf)  =  .ufa  +  P'bM. 

In  practice,  it  would  often  be  convenient  to  express  these  estimates  in 
terms  of  the  multiple  correlation  coefficient.  If  the  metric  of  the  third  seetion 
is  assumed,  the  elements  of  z,  and  z2  will  be  the  correlations  between  the 
factor  scores  and  the  criterion,  or  factor  validities.  Since  the  factor  scores 
arc  uncorrelated,  the  squared  multiple  correlation  between  the  first  L  factors 
and  the  eriterion  will  be 


(95) 


Rl  =  Zl'Zl  =  1  —  z2z2  —  z&s. 
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Hence  (91)  and  (92)  arc  equivalent  to 

(90)  Lfo  =  1  -Rl  +  -7ffi  ~  %l) 

and 


(9/) 


.1/  —  l 


-Rl  + 


27,(1  -  R\,) 
N  -  ,1/ 


Equation  (96)  is,  of  course,  equivalent  to  (49).  Although  and  will 
in  general  differ  only  very  slightly,  the  former  is  to  be  preferred  in  applica¬ 
tions,  since  RL  will  be  less  inflated  by  overfit  than  will  RM. 

In  theoretical  comparisons  of  different  factor  solutions,  will  be  most 
useful,  since  it  is  a  function  of  the  loadings  of  the  discarded  factors.  The 
optimal  factor  solution  would  be  that  which  minimized  the  rightmost  term 
of  equation  (90). 


Some  Particular  Reduced  Rank  Procedures 

Of  the  five  particular  rank-reduction  procedures  considered  in  the 
present  study,  three  involve  prediction  from  principal-axes  factors,  and  two 
involve  prediction  from  a  subset  of  the  original  predictors.  Summerhcld  and 
Lubin  (1951)  have  shown  that  a  subset  of  predictors  is  equivalent  to  a  subset 
of  orthogonal  triangular  (or  square-root)  factor  scores.  The  first  factor  is 
simply  the  first  predictor.  The  second  factor  is  that  portion  of  the  second 
predictor  which  cannot  be  predicted  from  the  first.  The  third  factor  is  that 
portion  of  the  third  predictor  which  cannot  be  predicted  from  the  first  and 
second.  The  remaining  factors  arc  similarly  obtained.  Each  factor  will  thus 
be  independent  of  the  earlier  factors  and  of  the  predictors  corresponding  to 
them,  and  will  therefore  have  zero  loadings  on  those  predictors.  Accordingly, 
the  factor-loading  matrix  will  be  a  lower  triangular  matrix,  i.c.,  its  supra- 
diagonal  elements  will  all  be  zero. 

The  predictor-selection  and  predictor-elimination  methods  may  be 
thought  of  as  procedures  for  placing  the  predictors  in  the  approximate  order 
of  their  contribution  to  the  multiple  correlation  with  the  criterion.  Since  the 
triangular  factors  arc  determined  by  the  ordering  of  the  predictors,  the  first  L 
factors  will  tend  to  give  the  highest  multiple  correlation  obtainable  with  a 
subset  of  L  predictors. 

Prediction  from  the  principal-axes  factors  giving  the  highest  validity  is 
similar  to  these  methods  in  that  the  subset  of  factors  to  be  retained  is  entirely 
determined  by  the  characteristics  of  the  sample  from  which  regression 
weights  arc  to  be  computed.  Under  these  circumstances,  none  of  the  indices 
of  validity  or  weight-validity  is  directly  applicable,  since  all  are  based  on  the 
assumption  that,  for  given  L,  the  subset  of  predictors  to  be  retained  is  deter¬ 
mined  in  advance  of  observing  the  criterion.  A  detailed  analysis  of  the  con- 


20 


REDUCED  RANK  MODELS  FOR  MULTIPLE  PREDICTION 


sequences  of  choosing  factors  on  the  basis  of  tlic  observed  y  will  not  be 
attempted.  Clearly,  however,  the  fewer  the  degrees  of  freedom  available,  the 
larger  will  be  the  variance  of  the  sample  validities,  and  the  smaller  the. 
probability  that  the  subset  of  L  faetors  having  tlic  largest  true  validity  will 
give  the  largest  sample  validity.  Moreover,  the  true  validity  for  the  subset 
chosen  would  tend  to  fall  short  of  the  true  validity  for  the  optimal  subset, 
and  the  sample  validity  for  the  chosen  subset  would  tend  to  overestimate  its 
true  validity,  in  inverse  proportion  to  the  degrees  of  freedom.  Still,  it  seems 
that  subsets  of  predictors  selected  in  this  way  would  usually  have  higher  true 
validities  than  would  arbitrarily  chosen  predictors. 

Although  the  foregoing  discussion  is  not  concrete  enough  to  lead  to 
precise  conclusions,  it  does  suggest  the  desirability  of  having  a  method  of 
factoring  that  would  provide  an  a  priori  expectation  as  to  the  contributions 
to  validity  of  the  individual  faetors.  The  success  of  using  approximation  to 
the  intcreorrelation  matrix  or  to  its  inverse  as  a  criterion  for  selecting  pre¬ 
dictors  will  in  part  be  determined  by  the  extent  to  which  contribution  to  the 
approximation  is  related  to  contribution  to  validity. 

In  describing  the  two  particular  factor  methods  in  terms  of  the  general 
model  of  the  preceding  section,  we  will  consider  first  the  triangular  factors. 
For  the  general  factor-loading  matrix,  b,  we  substitute  a  lower  triangular 
factor-loading  matrix,  t.  But  where  b  was  partitioned  only  after  the  Lth 
column,  we  will  partition  t  also  after  the  Lth  row,  so  that 


(98) 


t  =  [i. 


Lt„ 


0 

L  2- 


We  will  partition  the  inverse  of  t  similarly,  and  denote  it  by  T' .  It  may  be 
readily  verified  that 


(99) 


rpr  __ 

! 

^-1 

til 

0  " 

1  — 

“ 

-TL 

_  t 

1 22  *21  III 

r.i_ 

It  will  also  be  convenient  to  partition  the  predictor  matrix  x  after  the  Lth 
column,  and  to  partition  the  regression  vectors  /3  and  /3  after  the  Lth  element. 
Wo  first  note,  from  (52),  that 


(100)  x  =  [ x ,  x2 ]  =  u,t!  -j-  u2t'2  =  [«,<{ ,  Utli2]  +  [0  u2t22]. 


Tims 


(101)  rqtf  • —  [.r,  Uit{2] 

and 


(102)  x2  —  u i  / , 2  — j-  u2ti2 . 

The  first  term  on  the  right  of  (102)  is  that  portion  of  x2  which  can  be  predicted 
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from  x,,  while  the  second  term  is  that  portion  of  x2  which  is  independent  of  xv. 
Thus  the  “reduced-rank  approximation”  of  x  on  which  predictions  are  based 
is  from  (101)  composed  simply  of  the  retained  predictors  augmented  by  the 
portion  of  the  discarded  predictors  that  is  determined  by  those  retained. 
From  (63)  and  (65),  the  estimated  regression  weights  will  be 

A“ 

02- 


Their  expected  values,  under  the  full-rank  hypothesis,  will  be,  from  (76) 


(101)  E(fi)  =  Til’i8  = 

K.  t'J01 

_  ns,  +  (t'nyitii02i . 

•  1 

~E(0i) 

_  0  - 

102- 

L  o 

-Em. 

The  value  for  E(0,)  in  (104)  may  be  thought  of  as  an  expression  for  the 
optimal  weights  for  a  subset  of  predictors  in  terms  of  the  optimal  weights 
for  the  entire  set.  The  original  weights  for  the  retained  predictors  are  altered 
as  a  function  of  the  original  weights  for  the  discarded  predictors.  This  illus¬ 
trates  the  point  made  in  the  section  on  accuracy  of  predictions,  to  the  effect 
that  weights  for  a  subset  of  predictors  cannot  be  properly  evaluated  in  terms 
of  how  closely  they  approximate  the  weights  for  the  entire  set.  The  covariance 
matrix  of  the  sample  regression  weights,  obtained  from  (79),  is 

(l'n')lu  0| 

.  0  oJ 

The  expected  values  of  the  transformed  criterion  observations  will  be, 


from  (83), 

>*.)] 

i 

i 

tllftl  +  ^22$  2 

(106) 

m  = : 

E(z2) 

_ 

t’S 

= 

^22^2 

mz3): 

L  o 

0 

From  (90),  the  inverse  index  of  weight  efficiency  u'l'i  is  given  by 

(107)  M'Pl  =  (Ar  +  L)<r'!  +  0 =  (iV  4-  L)<?~  +  j3(<22^22|32. 

To  obtain  the  principal-axes  solution,  we  first  express  the  predictor 
matrix  x  in  terms  of  its  basic  structure  (Horst,  1961,  ch.  17): 

(108)  x  =  PAQ’. 

Now,  in  place  of  the  general  factor-seorc  matrix  u  we  have  the  principal-axes 
factor-score  matrix  P.  The  principal-axes  faetor-loading  matrix,  corresponding 
to  the  general  b  is  given  by  Q  A,  where  Q  is  a  square  orthonormal  and  A  a 
diagonal  matrix.  Equation  (50)  now  takes  the  form 

(109)  x'x  =  QA2Q'. 


(105) 


Ci  =  =  <j‘ 


(103) 


8  =  T1u[y 


{t[i)  'u'iV 
0 
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The  eigenvalues  and  eigenvectors  of  x'x  will  be  given  by  the  elements  of  A2 
and  the  columns  of  Q  respectively.  We  may  partition  the  factors  on  the  right 
of  (108)  to  obtain 


(110) 


[/’. 

p>] 

A,  0 

■ Qf 

_o  a2j 

QL 

[Pi 

P2] 

\qC 
L  AM 

=  PAxQl  +  PtA&i. 


As  before,  both  the  factor-score  and  factor-loading  matrices  arc  considered 
to  be  partitioned  after  the  7.th  column.  For  the  inverse  of  the  factor-loading 
matrix,  B',  we  will  now  have 


(111) 


Kt  A,  QaA2] 


a  r‘o; 

-A  VQi 


The  sample  regression  vector  is,  from  (03)  and  (65), 

(112)  B  =  QAllP[y. 


Under  the  full-rank  hypothesis,  the  lower-rank  sample  regression  weights  will 
have  the  covariance  matrix,  from  (79), 

(113)  Cf  =  a^QtA^Ql. 


From  (83),  the  canonical  form  of  the  criterion  will  have  the  expectation 

(114)  E(z)  = 


XQW 

E{z2) 

= 

A  JQ1P 

E{z3)_ 

0 

Equation  (90)  will  now  take  the  form 
(115)  a,4/0  =  (X  +  /V  + 

The  specific  reduced-rank  prediction  models  may  be  obtained  from  the 
foregoing  development  by  assuming  appropriate  permutations  either  of  the 
predictors,  in  the  case  of  triangular  factors,  or  of  the  columns  of  P  and  Q, 
and  of  the  elements  of  A,  in  the  ease  of  principal-axes  factors.  We  note  from 
(73)  and  (83)  that  each  element  of  z,  and  z2  is  determined  by  only  one  factor: 
the  observed  value  by  the  factor  scores,  the  expected  value  by  the  factor 
loadings.  In  predictor  selection,  each  time  a  predictor  is  selected,  a  factor, 
and  hence  an  element  of  z ,,  is  determined.  At  each  step  in  the  procedure, 
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that  predictor  is  selected  which  will  make  the  next  element  of  2,  as  large 
(in  absolute  value)  as  possible.  In  predictor  elimination,  a  factor  and  hence 
an  element  of  z2,  is  determined  eaeh  time  a  predictor  is  eliminated.  At  each 
step,  that  predictor  is  eliminated  which  will  make  the  next  element  of  z2  as 
small  (in  absolute  value)  as  possible. 

In  the  method  of  predicting  from  the  factors  giving  the  best  least-squares 
approximation  to  the  predictor  intercorrelation  matrix,  the  elements  of  A  arc 
placed  in  order  from  largest  to  smallest,  so  that  the  largest  arc  in  A,  and  the 
smallest  in  A2.  If  the  inverse  is  to  be  approximated,  the  elements  of  A  arc 
placed  in  the  opposite  order,  i.e.,  from  smallest  to  largest.  (When  we  speak 
of  ordering  the  elements  of  A,  wo  assume,  of  course,  that  the  columns  of  P 
and  Q  are  permuted  correspondingly.)  In  the  method  of  predicting  from  the 
principal-axes  factors  giving  the  highest  validity,  the  factors  are  permuted 
so  as  to  place  the  elements  of  z,  and  z2  in  order  of  absolute  value  from  largest 
to  smallest,  with  the  largest  values  in  z,,  the  smallest  in  z.,. 

The  Problem  oj  Finding  an  Optimal  Reduced-Rank  Solution 

There  arc  three  major  problems  involved  in  obtaining  an  optimal  reduced- 
rank  solution.  The  first  concerns  the  method  of  rank  reduction:  whether 
subsets  of  the  original  predictors,  of  the  principal-axes  factors,  or  of  factors 
obtained  by  some  other  method  will  give  the  most  accurate  prediction  in 
future  samples.  The  second  problem  is,  having  obtained  the  factors,  to 
specify  the  subset  of  a  given  size  that  may  be  expected  to  provide  the  greatest 
accuracy  of  prediction.  The  third  problem  is,  having  specified  the  subset 
which  would  be  used  for  any  given  rank,  to  determine  the  particular  rank 
that  will  tend  to  lead  to  the  most  accurate  predictions. 

The  estimate  of  the  inverse  index  of  weight-efficiency  given  in  (91)  and 
(90)  provides  a  solution  (or  a  potential  solution)  to  the  third  problem.  It 
docs  not,  however,  enhance  our  ability  to  deal  with  the  second  problem,  since, 
as  can  be  seen  from  (90),  it  merely  indicates  the  traditional  approach;  namely, 
to  attempt  to  select  that  subset  of  predictors  of  given  size  having  the  highest 
multiple  correlation  with  the  criterion.  The  drawbacks  of  such  an  approach 
when  degrees  of  freedom  arc  limited  wore  discussed  in  the  preceding  section. 
Since  a  reduced-rank  solution  is  indicated  only  when  degrees  of  freedom 
are  limited,  a  selection  method  that  is  independent  of  the  criterion  might 
well  be  preferable.  Some  evidence  favoring  this  view  is  provided  in  the 
empirical  portion  of  the  present  study.  In  the  present  section  we  assume  that 
view  to  be  correct  and  accordingly  consider  only  methods  of  selection  which 
arc  independent  of  the  criterion. 

If  the  present  analysis  is  correct,  an  optimal  solution  will  be  one  which 
minimizes  u'l't  as  given  in  (90).  In  the  absence  of  observations  on  the  criterion, 
nothing  can  be  said  about  /3  or  a2,  so  our  only  course  is  to  seek  a  value  for 
&2  which  will  minimize  fi%b'2p  for  general  /3.  The  quantity  to  be  minimized 
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may  also  be  expressed  as  the  sum  of  squares  of  the  expected  values  of  the 
Zi,  as  given  in  (83) : 

(116)  fi'bj,#  =  [E(z2)Y[E(z2)]. 

Minimizing  this  quantity  will  be  equivalent  to  making  the  elements  of  E(z2) 
as  small  (in  absolute  value)  as  possible.  We  let  the  ith  element  of 


be  denoted  by  z{.  If  we  knew  these  values,  the  second  of  the  problems  stated 
above  would  be  solved  by  discarding  those  factors  for  which  z;  was  smallest. 
Denoting  the  column  of  factor  loadings  for  the  ith  factor  by  we  have, 
from  (83), 

(118)  2,-  =  b'.S- 

Let  D  be  a  diagonal  matrix  whose  ith  element  is  given  by 

(119)  Di  =  WA, 

Let 

(120)  IF  =  6ZT\ 

Denoting  the  ith  column  of  IF  by  IF.,-,  we  have 

(121)  W’.tW.t  =  =  1. 

The  expected  values  of  st  and  z2  can  now  be  expressed  in  terms  of  D  and  TF  as 

(122)  z  ~  b'0  =  DIF'S, 
or 

(123)  Zi  =  D<W:<0. 

Since  we  have  assumed  that  nothing  is  known  about  0,  and  since  (121)  holds 
for  all  i,  we  can  have  no  a  priori  expectation  as  to  the  magnitude  of  iF',/3. 
Thus  our  only  basis  for  predicting  the  rank  order  of  the  z,-  in  the  absence  of 
criterion  observations  will  be  the  magnitudes  of  the  D;.  A  tentative  solution 
for  the  problem  of  which  factors  to  retain  for  prediction,  then,  will  be  to  dis¬ 
card  those  faetors  having  the  smallest  values  of  D,.  From  (1 19),  we  see  that  D < 
is  the  sum  of  squares  of  the  loadings  for  the  ith  factor,  or  the  variance  ac¬ 
counted  for  by  that  factor.  Thus,  for  a  rank-L  solution,  we  wish  to  retain  those 
L  factors  giving  the  best  least-squares  approximation  to  the  predictor  matrix. 

It  is  well  knowm  that  the  principal-axes  factors  will  give  a  better  least- 
squares  approximation  to  the  predictor  matrix  than  will  factors  obtained 
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by  any  other  method.  Thus,  as  a  tentative  answer  to  the  first  of  the  above 
problems  we  obtain  the  principal-axes  solution. 

Now,  given  the  restriction  that  the  factors  be  selected  independently  of 
the  criterion,  we  can  state  that  the  best  prediction  possible  with  a  rcduccd- 
rank  solution  will  be  obtained  from  the  principal-axes  factors  giving  the 
best  least-squares  approximation  to  the  correlation  matrix.  We  note  that,  for  a 
principal-axes  solution,  D  and  TF  become  the  A  and  Q  of  the  preceding  section. 
Thus  wc  can  also  state  that  the  method  of  approximating  the  inverse  will  give 
the  worst  possible  predictions,  since  with  that  method  one  discards  the  factors 
corresponding  to  the  largest  elements  of  A. 

We  have  shown  that,  with  appropriate  assumptions,  the  principal-axes 
factors  making  the  largest  contribution  to  the  variance  of  the  predictors  (or 
simply,  the  largest  principal-axes  factors)  are  optimal  with  respect  to  our 
index  of  expected  accuracy  of  prediction.  It  may  be  shown  that  the  factors 
arc  also  optimal  with  respect  to  the  variance  of  the  sample  regression  xveights. 
The  sum  of  these  variances  will  be  smaller  than  for  any  other  method  of  rank 
reduction.  From  (G9)  (or  (79)),  this  sum  will  be  proportional  to  the  trace  of 
BiB[.  We  let 

(124)  g'  =  Bu'  =  B,u[  -f  B2u’2, 
so  that 

(125)  g'  —  B2  n'2  =  Bxu[. 

It  is  well  known  that 

(12G)  tr  (ulB[Bl «{)  =  tr  (B,B[) 

will  be  a  minimum  when  II,  is  composed  of  the  largest  principal-axes  factors  of 

(127)  g'g  =  BB'  =  ( x'x )  1  =  QA2Q’. 

Equivalently,  the  above  traec  will  be  a  maximum  when  b ,  is  composed  of  the 
largest  principal-axes  factors  of  x'x. 

The  major  conclusion  of  this  section  is  that,  in  the  absence  of  criterion 
observations,  the  best  index  to  use  for  selection  of  predictors  or  factors  will 
be  the  amount  of  variance  accounted  for  in  the  predictor  data  matrix.  In  the 
case  where  a  subset  of  the  original  predictors  is  to  be  used,  one  would  eliminate 
those  predictors  for  which  the  trace  of  t22l'22  in  (107)  is  a  minimum.  Where 
a  factor  solution  is  feasible,  the  largest  principal-axes  factors  would  be  re¬ 
tained.  The  important  question  of  how  many  degrees  of  freedom  must  be 
available  before  the  criterion  observations  can  be  used  to  advantage  in  the 
selection  process  has  been  left  open.  Thus  a  sound  basis  for  deciding  whether 
to  use  the  methods  above  or  to  use  methods  xvhich  attempt  to  maximize  the 
sample  multiple  correlation  with  the  criterion  is  still  lacking. 


CHAPTER  3 


AX  EMPIRICAL  COMPARISON  OF  FIVE 
REDUCED  RANK  PROCEDURES 

The  Data 

A  typical  application  of  regression  methods  is  to  the  problem  of  predicting 
academic  success  as  measured  by  eollege  grades.  The  data  for  the  present 
comparisons  were  taken  from  a  reeent  study  of  academic  prediction  by 
Shankcr  (1961).  Twenty-nine  predictor  variables  and  five  separate  criterion 
variables  are  used.  Fifteen  of  the  predictors  are  those  composing  the  Uni¬ 
versity  of  Washington  Entrance  Battery.  These  have  been  in  use  for  predicting 
college  grades  since  1953,  and  include  age,  sex,  test  scores,  and  high-school 
grades.  The  remaining  predictors  are  taken  from  the  Edwards  Personal 
Preference  Schedule  (EPPS).  The  15  variables  of  the  EPPS  are  ipsativc;  i.c., 
any  one  can  be  computed  exactly  from  the  remaining  14.  Accordingly,  only 
14  are  used  here,  since  the  15th  would  be  completely  redundant  for  purposes 
of  prediction.  The  EPPS  variables  arc  described  by  Edwards  (1954).  Descrip¬ 
tions  of  the  Entrance  Battery  variables  are  given  by  Shankcr  (1901).  Since 
the  specific  nature  of  the  predictors  is  not  of  immediate  interest  in  the  present 
study,  we  simply  list  them  here. 


Edwards  Personal  Preference  Schedule  Variables 


1.  Achievement 

8. 

Suocorancc 

2.  Deference 

9. 

Dominance 

3.  Order 

10. 

Abasement 

4.  Exhibition 

11. 

Nurturanee 

5.  Autonomy 

12. 

Change 

0.  Affiliation 

13. 

Endurance 

7.  Intraception 

14. 

Heterosexuality 

High-Sehool  Grade-Point  Averages 

15.  English 

18. 

Social  Science 

16.  Mathematics 

19. 

Natural  Science 

17.  Foreign  Language 

20. 

Electives 

Test  Seores 

21.  Vocabulary 

25. 

Mathematics 

22.  Mechanical  Knowledge 

20. 

Social  Science 

23.  English  Usage 

27. 

Quantitative  Reasoning 

24.  English  Spelling 
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Other  Variables 

28.  Age 

29.  Sex  (coded  0  for  male,  1  for  female) 

The  criterion  variables  consist  of  grade-point  averages  in  various  college 
course  areas.  The  five  criteria  chosen  for  the  present  study  were  those  having 
500  or  more  cases  available,  as  listed  below. 

1.  All-University,  973  cases  4.  Chemistry,  526  eases 

2.  Mathematics,  541  cases  5.  Psychology,  507  cases 

3.  English  Composition,  804  cases 

The  cases  used  were  973  students  who  entered  the  University  of 
Washington  as  freshmen  between  1953  and  1958.  Only  those  students  were 
included  for  whom  measurements  on  all  predictors  and  at  least  one  criterion 
variable  were  available.  Scores  on  the  criterion  variables  and  on  the  Entrance 
Battery  (predictors  1 5-29)  were  obtained  from  the  files  of  the  University  of 
Washington  Division  of  Counseling  and  Testing  Services.  The  EPPS  data 
(predictors  1-14)  were  obtained  partly  from  Edwards,  partly  from  Wright 
(1957),  and  largely  from  the  Division  of  Counseling  and  Testing  Services  files. 


Method 

The  five  reduced-rank  prediction  methods  chosen  for  comparison  were 
the  following. 

1.  The  predictor-elimination  method  (Horst  and  MacEwan,  1900) 

2.  Predictor  selection  by  the  accretion  method  (Horst,  1955) 

3.  The  method  of  largest  principal-axes  factors  (Horst,  1941) 

4.  The  method  of  smallest  principal-axes  factors  (Guttman,  1958) 

5.  The  method  using  the  principal-axes  factors  giving  the  highest 
multiple  correlation. 

As  noted  in  the  introduction,  we  can  be  virtually  certain  that,  for  suffi¬ 
ciently  small  samples,  one  or  more  of  these  methods  will  give  more  accurate 
predictions  than  will  the  standard  full-rank  method.  And  as  shown  in  the 
last  section  of  Chapter  2,  there  is  reason  to  believe  that  method  3  will  be 
superior  to  the  others  for  samples  below  some  critical  size.  Similarly,  method 
4  would  be  expected  to  give  the  poorest  predictions.  We  would  expect  also 
that  the  statistics  as  given  by  (91)  and  if'  as  given  by  (46)  would  give 
some  indication  of  the  accuracy  of  prediction  in  future  samples  obtainable 
from  a  particular  set  of  weights. 

The  method  used  for  the  empirical  comparisons  consisted  essentially  of 
replications  of  the  following  procedure.  All  cases  with  measurements  available 
on  a  particular  criterion  were  taken  as  the  statistical  population.  From  this 
population  a  random  sample  was  drawn.  Regression  weights  were  computed 
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for  each  reduced-rank  method  for  each  rank  from  1  to  29.  Thus  29  sets  of 
weights  for  each  method  were  computed.  The  sets  of  weights  for  rank  29 
were,  of  course,  the  same  (aside  from  rounding  error)  for  all  methods.  From 
the  cases  remaining  in  the  population  after  the  original  sample  was  removed, 
a  new  random  sample  was  drawn.  Each  set  of  weights  computed  in  the 
original  sample  was  then  applied  to  the  new  sample,  and  measures  of  accuracy 
of  prediction  were  computed.  For  all  computations,  predictor  and  criterion 
variables  were  normalized  as  described  in  the  second  section  of  Chapter  2. 
In  effect,  then,  means  and  sums  of  squares  were  equated  for  all  variables 
on  all  samples.  Differences  in  these  values,  therefore,  do  not  show  up  in  the 
total  squared  errors  of  prediction. 

For  each  of  the  five  criterion  variables,  this  design,  using  all  five  reduced- 
rank  methods,  was  replicated  for  six  different  original-sample  sizes:  255,  210, 
105,  120,  75,  and  30  cases.  The  new  samples  consisted  of  252  cases  for  all 
replications.  Weight-validities  were  used  as  measures  of  accuracy  of  prediction. 

An  additional  set  of  replications  was  carried  out  for  criterion  1  (All- 
University)  only,  and  omitting  method  4.  Here  the  estimates  of  weight- 
validity  and  of  total  squared  errors  of  prediction  were  also  computed  from 
the  original  samples.  A  wider  range  of  original-sample  sizes  was  used :  the  six 
sizes  above  and  also  sizes  of  435,  390,  345,  and  300  cases.  A  second  new 
sample  was  randomly  drawn  for  each  replication  from  the  eases  remaining 
in  the  population  after  the  original  sample  and  the  first  new  sample  were 
removed.  Both  new  samples  again  consisted  of  252  cases  for  all  replications. 
As  measures  of  accuracy  of  prediction  when  the  original  sample  weights  were 
applied  to  each  of  the  two  new  samples,  total  squared  errors  of  prediction 
as  well  as  weight-validities  were  computed. 

All  phases  of  the  above  procedures  were  carried  out  on  the  IBM  709 
computer,  using  programs  written  especially  for  this  study.  The  method  of 
drawing  the  samples  was  as  follows.  The  cases  in  a  particular  criterion  popula¬ 
tion  of,  say,  NT1  students  were  assigned  sequential  numbers  from  1  to  NT. 
A  sequence  of  random  numbers  was  generated  using  a  procedure  described 
in  the  WDPC  Users  Manual  (Western  Data  Processing  Center,  1901,  see. 
9.2.4).  The  original  sample  of  size  N0  consisted  of  the  cases  corresponding  to 
the  first  N0  distinct  numbers  modulo  NT  from  the  sequence  of  random 
numbers.  The  remaining  A"?1  —  N0  cases  were  renumbered  sequentially  from 
1  to  NT  —  No-  The  new  sample  of  size  Ar,  consisted  of  the  first  Nt  distinct 
numbers  modulo  NT  —  N0  from  a  second  sequence  of  random  numbers.  In  a 
similar  way,  all  other  samples  were  obtained,  using  a  new  sequence  of  random 
numbers  for  each  sample. 

After  obtaining  the  original  sample,  the  matrix  of  predictor  intercorre¬ 
lations  and  the  vector  of  the  correlations  between  the  predictors  and  the 
criterion  were  computed.  Retaining  the  notation  of  the  preceding  chapter 
and  recalling  that  the  variables  in  x  and  y  were  normalized,  the  predictor 
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intorcorrclation  matrix  was  computed  by  (25)  and  the  vector  of  predictor- 
criterion  correlations  by 

(12S)  rc  =  x’y. 

Next  the  predictor  elimination  and  predictor  selection  procedures  were  carried 
out  and  the  corresponding  regression  weights  computed,  using  the  procedures 
described  by  Horst  and  MacEwan  (1960)  and  by  Horst  (1955),  respectively. 
The  matrix  r  was  then  factored  as  in  (109).  The  regression  weights  for  the 
three  principal-axes  methods  were  computed  as  follows.  We  let  zL  denote  the 
Lth  clement  of  z ,,  Q,L  denote  the  Lth  column  of  Q,  and  AL  the  Lth  element 
of  A,. 

First  the  vector  of  factor  validities  Zi  was  computed  from 

(129)  *!  =  A7‘Qfr„. 

Equation  (129)  is  equivalent  to  (73),  since,  from  (108),  (110),  and  (128),' 

(130)  A  TlQlrc  =  A  T'Q’x'y  =  Ar‘QKQ,A,P{  +  Q2A2P’2)y  =  Ply. 

The  regression  vector  for  rank  L  was  computed  by 

(131)  =  <?, Ar1*,  =  E  Q.ttf*, 

»«*  1 

which,  it  may  be  noted,  is  equivalent  to  (112).  Thus  the  regression  vector 
for  rank  L  +  1  was  obtained  from  the  vector  for  rank  L  by 

(132)  1  =  &L  "b  Q  .L  +  iAl-hZl+1  . 

The  weights  for  methods  3,  4,  and  5  were  all  computed  in  the  same  way,  the 
only  difference  being  in  the  order  of  summation. 

The  new  sample  was  drawn  and  the  various  correlations  computed  as 
for  the  original  sample.  The  weight-validity  and  total  squared  errors  of 
prediction  obtained  with  a  particular  vector  of  weights  were  computed 
respectively  by 

(133)  TF  = 
and 

(134)  *  =  1  -  2r'JL  +  ftr/3 1. 

Equations  (133)  and  (134)  are,  of  course,  equivalent  to  (42)  and  (43).  Note 
that  r  and  rc  in  (133)  and  (134)  are  computed  on  the  new  sample  while  pL 
was  computed  on  the  original  sample. 

Results  and  Discussion, 

The  weight-validities  obtained  with  methods  1,  2,  3,  and  5  on  all  five 
criteria  are  given  in  Table  1.  The  six  pages  of  Table  1  correspond  to  the 


r'r0L 

VPlrfL 


Weight-Validities  for  Four  Methods  and  Five  Criteria 


30 


REDUCED  RANK  MODELS  FOR  MULTIPLE  PREDICTION' 


c 

X 

x 

C/1 

CO 

IP 

x 

fV* 

00 

; — ) 

— 

CO 

3 

X 

-P 

T-l 

3 

— 

X 

CT 

TO 

X 

ff. 

X 

V 

CO 

y 

X 

y 

/ 

lO 

1- 

rP 

l>- 

1- 

I- 

sC 

p 

1  - 

1- 

1  -» 

1  - 

1- 

1  - 

”P 

X 

”P 

"p 

TP 

• 

O’ 

TP 

*p 

”T 

-P 

— 

X 

X 

X 

o 

X 

PI 

l-T 

PI 

SO 

— J 

_ 

CO 

iO 

X 

0 

T'l 

_ 

PI 

X 

IO 

m 

3 

X 

x 

x 

CO 

/ 

or 

CO 

CO/ 

CO 

o 

c 

CO 

X 

r/i 

X 

/ 

/ 

-'/) 

X 

x 

X 

TP 

TP 

X 

x 

-r 

X 

T 

— 

LO 

-P 

Tp 

LO 

LO 

Tp 

TP 

TT 

TP 

X 

X 

X 

X 

X 

X 

© 

3 

, _ 

-p 

— 

l'- 

— 

CO 

ro 

>o 

CO 

_ 

»n 

X 

-*• 

(“5 

3 

X/ 

X 

1^ 

I'- 

© 

1^. 

i'- 

PI 

r 

CO 

/ 

/ 

ro 

CO 

or; 

or; 

1  - 

1^. 

1  >• 

1  - 

I'- 

i- 

1  - 

i  •— 

>o 

LO 

*o 

T 

T 

~p 

-p 

”P 

~P 

Tp 

T- 

"■ 

" 

TP 

TP 

TP 

”P 

T 

X 

X 

X 

X 

X 

X 

X 

G5 

I'- 

x 

r*r 

PI 

X 

x 

"M 

i- 

LO  CO 

LO 

sc 

_4 

3 

X 

X 

1- 

1  ^ 

3 

l_ 

1- 

t- 

3 

© 

x 

I  - 

X 

s 

iO 

1*0 

=r* 

i  - 

CO 

1^ 

1  - 

C/J 

CO 

TP 

1". 

1  - 

r- 

-P 

-r 

x 

i-- 

X 

X 

X 

1- 

X 

1- 

PI 

3 

X 

3 

© 

ro 

!>• 

o 

o 

»o 

PI 

3 

X 

IO 

X 

X 

1- 

PI 

© 

3 

i  - 

so 

1 

© 

-r 

-P 

”1' 

TP 

-p 

PP 

*T 

TT 

Tp 

“P 

X 

X 

X 

'O 

CO 

TO 

_ 

o 

LO 

1  ~ 

o 

m 

l~ 

PI 

lO 

rvv 

1- 

CO 

-/> 

PI 

© 

CO 

X 

CO 

1  - 

1- 

1  - 

C/j 

o 

1- 

3 

-P 

CO 

CO 

TP 

CO 

CO 

CP 

cp 

CO 

CO 

CO 

CO 

co  co 

co  co 

Tp 

-P 

X 

X 

X 

X 

X 

X 

X 

x 

3 

© 

Ol 

PI 

;-} 

X, 

rr\ 

(— , 

00 

CO 

X 

X 

X  X 

3 

_ 

© 

o 

PI 

>-o 

N 

iO 

P| 

CO 

o 

- 

X 

TP 

X 

TP 

~P 

X 

CO 

CO 

”P 

— 

Tp 

-t 

“p 

Tp 

T* 

*P 

TP 

Tp 

X 

X 

X 

X 

X 

X 

X 

— 1 

01 

CO 

00 

CO 

Cl 

Cl 

Cl 

»o 

PI 

Cl 

l>- 

o 

iO 

~P 

CO 

X 

PI 

J-) 

1  _ 

X 

X 

3 

X, 

© 

3 

X 

I- 

CO  CO 

/ 

pA 

© 

© 

© 

PI 

3 

x 

CO 

-I' 

CO 

Tp 

TP 

“P 

“P 

X 

3 

_ 

CO 

«o 

CO 

ro 

CO 

CO 

LP 

3 

X 

I''* 

3 

© 

© 

co 

3 

© 

° 

cc 

sc 

o 

sc 

SC 

sc 

sc 

3 

SC 

3 

3 

3 

3 

3 

3 

3 

3 

3 

o 

3 

G 

o 

CO 

o 

*p 

CO 

LO 

«o 

CO 

PI 

—4 

. 

PI 

ro 

X 

3 

lO 

3 

X 

3 

3 

— 

12 

— — 

o 

© 

© 

LO 

o 

3 

w 

© 

w 

o 

o 

sc 

sc 

sc 

sc 

SC 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

© 

3 

3 

© 

>o 

o 

ro 

h- 

X 

1^ 

pi 

— 

1  — 

sc 

1- 

X  LO 

_ 

PI 

PI 

PI 

X 

-M 

© 

3 

1  ^ 

hi. 

(M 

X 

1  - 

x 

-r 

i.O 

PI 

ri 

-I 

X 

G 

3 

o 

© 

© 

so 

CO 

© 

o 

SC 

SC 

SC 

3 

3 

3 

3 

3 

3 

3 

3 

o 

3 

3 

3 

— 

PI 

lO  I-. 

CO 

>o 

_ 

rv 

_ 

_ 

CO 

IT 

-r 

so 

1^ 

m 

iO 

X 

o* 

X 

_ 

3 

X 

SO 

*n 

o 

3 

I'- 

1^ 

3 

x 

so 

— 

— “ 

r T' 

co 

©  1- 

o 

3 

w 

w 

w 

w 

3 

3 

w 

3 

3 

I— 

h- 

e 

CO 

X 

© 

1- 

Cl 

i.O 

LO 

3 

>0 

PI 

3 

PI 

PI 

hr 

y 

or; 

(J, 

or; 

X 

X  X 

X  X 

CO 

XXX 

© 

o 

PI 

X 

m 

3 

PI 

PI 

— 

© 

3 

3 

CO 

— 

X 

"p 

-p 

*p 

Tp 

X 

Tp 

*P 

Tp 

Tp 

TP 

Tp 

Tp 

~~ 

TP 

*P 

~r 

T 

f 

X 

X 

X 

CO 

co 

CO 

CO 

CO 

Q 

PI 

LO 

X 

_M 

PI 

o 

rvs 

CO 

r*. 

CO 

LO 

PI 

m 

cn 

rr 

-*4 

rv-, 

PI 

X 

X 

X 

3 

X 

CM 

X 

PI 

Cl 

c/j 

1  - 

i- 

i  - 

-/ 

X 

X  X 

co 

co 

-p 

”p 

TP 

Tp 

CP 

CO 

CP 

CO 

co 

CP 

CO 

CO 

co 

co  co 

co 

X 

X 

X 

CO 

CO 

3 

CO 

CO 

3 

3 

>o 

X 

>o  PI 

X 

PI 

00 

»p 

PI 

_■ 

o- 

pi 

TP 

X 

_ 

X 

•o 

X 

X 

3 

T 

1- 

PI 

PI 

1  - 

1" 

CC 

I- 

1- 

3 

i- 

i  - 

X 

co; 

X 

XXX 

CO 

X  X 

X 

X 

co 

X 

X 

X 

CO  co 

co 

CP 

CO 

ro 

CP 

CO 

co 

co 

" 

“ 

CO 

CO 

” 

X 

3  3 

LO 

»o  o 

-p  PI 

1— 

CO 

LO 

-p 

-H  1.0 

© 

PI 

»o 

3 

1  - 

r 

3 

o 

© 

s 

© 

©  © 

i-O 

LO 

*o 

LO 

»0  LO 

lO 

LO 

LO 

iO 

LO 

LO 

LO 

IO 

LO 

IO  iO 

LO  LO 

LO 

•o 

X 

LO 

3 

•o 

3 

3 

> 

X 

00  3 

X 

I- 

LO 

3 

1^ 

^1 

p- 

sc 

-p 

pr 

-p 

—4 

iO 

1C 

PI 

X 

X 

3 

I'- 

1- 

CO 

© 

© 

1- 

1  - 

1- 

1- 

I'- 

1- 

Tp 

1- 

1-  LO 

iO 

X 

PI 

© 

8 

© 

tp 

lO 

LO 

3 

>o 

LO  LO 

LO 

•  O  LO 

LO 

»o 

LO 

LO 

lO 

lO 

iO 

lO 

LO 

lO 

LO 

>o 

3 

»o 

L/C 

IO 

3 

>0 

X 

o 

Cl 

X 

PI 

CO' 

-« 

w 

CO 

r  -  co 

CC' 

o 

3 

o 

I'- 

— 

X 

3 

X 

>' 

CO  CO 

PI 

PI 

PI 

c/_ 

ro 

Cl 

o 

3 

3  © 

< 

TP 

Tp 

«o 

LO 

LO 

LO 

X 

Tp 

TP 

Tp 

T 

Tp 

TP 

-p 

”P 

LO 

TP 

LO 

X 

*T 

X 

X 

X 

*T* 

3 

3 

— H 

CO 

C1 

x 

PI 

CO 

CO 

SC 

^“5 

SC 

O' 

iO 

PI 

X 

co 

3 

© 

X 

X 

I'- 

X 

X 

3 

X 

© 

3 

r- 

1  ~ 

DC 

PI 

PI 

PI 

y 

/- 

X 

X 

X  X 

© 

© 

3 

x 

X 

LO 

LO 

LO 

LO 

TP 

Tp 

Tp 

Tp 

Tp 

Tp 

TP 

Tp  Tp 

Tp 

Tp 

TP 

Tp 

X 

X 

X 

X 

X 

X 

3 

3 

so 

SO 

ci 

2— < 

x 

01 

CO 

Tp 

LO 

© 

I'- 

v: 

Cl 

PI 

CP 

LO 

3 

1-  X 

3 

© 

PI 

X 

x 

lO 

3 

I" 

X 

3 

e; 

1  £ 

PI 

PI 

PI 

- 1 

PI 

PI 

PI 

‘u 

CJ 

o 

u 

sjjuvu 

Decimal  point  preceding  eaeh  entry  has  been  omitted. 


TABLE  1  (Cont.) 

Weight- Validities  for  Four  Methods  and  Five  Criteria 
(Ah,  =  210) 


GEORGE R  BURKET 


31 


-c  co 

03 

oi 


g  00 
5 

O  Ol 


o 

O 

WJ  Ol 


CO 

a 

<5  Ol 


CO 

oi 


X  ° 

o  £ 


C  lO  -f  N  : 


O  O  W  CO  Cl 
©  X  X  X  C3 
XXXXX 


CO  w  CO  ©  © 

tO>  t-O  X  X  X 


CO  ‘O  CO  - 
to  *r  —  : 
co  x  x  ■ 


CO  -H  Ol  CO  CO 
o  o  >o  CO  Ol 
X  X  X  X  X 


>o  w  ci  o 
t  ^  -r  o 
X  X  X  X  X 

X  t-O  CO  C3  I- 


*o  *o  Ol  — < 

X  X  -T  X 
X  X  X  X 


c  ci  |0  -i  o 

t-O  3  0  tO  *-0 


CO  I'*  w  00  >“  CO  N  N  Nl 

_M  — <  — .  W  Ol  —  —  —  —  . 

1.0  tO  tO  LO  1C  L.0  to  t-O  1.0  « 


Cl  H  H  CO  O 
CO  I-  CO1  o  CO 

-**  -r  -r  »o  to 

Ol  x  00  00  CO 
CO  »o  00  Ol  Ol 
-f  -T  -  1C  lO 


I-  I-  66  CO  'CO  do  X 


*o  CO  *0  Ol  I- 
Ol  CO  Ol  Ol  Ol 
i.O  »-0  »0  tO  o 


i-o  io  «-0  i-o  To  io  it. 


X  O  ©  O 
Ol  Ol  01  CO  CO 
tO  O  t-O  tO  io 

©  I-—  CO  CO  CO 
io  »*6  i.o 


©  ©  O  — I  CO 
X  X  Ol  Ol  — 
X  X  t-O  LO  1-0 

o  *o  C  Ol  X 

CO  X  X  X  Ol 

to  to  1.0  1-0  »o 

Cl  X  O  iO  IO 

X  01  Ol  Ol  Ol 

IO  to  *o  to  io 


CO  — <  ©  I-  X 

to  to  lO  o  «o 

O  I"  x  — < 

X  X  Ol  01  Ol 

to  to  to  1-0  to 

01  ©  ©  X  I' 

Ol  —  — 

to  to  ■“ 


i.O  1-0  to 


co  ©  ©  x  x  n  o.  s  co  .  . .  _ 
o  O  O.  C-  O  N  x  C  - 1  o  —  —  — 

IO  to  IO  to  O  to  io  C  O  C  ©  ©  ©  ©  © 


x  x  i-  co  -h 
f  o  o  o.  o 
to  to  to  to  o 

CCCh- 
OoOOI- 
X  to  LO  ©  © 

QOCN1 
C.OOOl- 
-  LO  to  c  c 


©  i'-  «o  x  co 

XXXXX 


oi  oi  i'-  ~  © 

O  C  O  -  Cl 
©  ©  ©  ©  © 

O  X  to  O  X 
Ol  01  X  X  Ol 
©  ©  ©  O  O 

©  x  co  oi  © 

Ol  Ol  Ol  Ol  ~ 

0  0-000 


•  X  X  O  co¬ 
ol  X  X  Ol 
o  o  o  o 


X  X  X  to  X 
—  CM  01  Ol  — 


M  O  X  X  X 
Ol  Ol  Ol  Ol  Ol 

o  o  o  o  o 

Ol  O  X  X  X 
'1  Ol  Ol  01  CM 


I  to  C  C  to 


3  O  tio  to  X  Ol  Ol  X 

:  -r  •— -  ©  ©  x  x  i-  o 

"  *t  *f  x  coxxxx 


O  Ol  o  x  X 
o  x  -o  x  Ol 
X  X  x  x  -0 


X  *0  -o  X  3 
-t  — «  ©  X  x 

X  X  CO  CO  CO 

^  X  40  01 
— >  i-  i-  ©  © 

x  co  co  co  co 

h  o  O  Cl  to 

—  r-  »-  ©  © 

x  x  x  x  x 


Ol  W  Ol  H  o 
X  ©  — ■  ©  03 
co  co  x  x  co 

X  'O  Ol  ©  o 

o  i  -  x  i- 

X  X  X  X  X 

N  -  Cl  O  O 

I—  x  x  i—  © 

X  X  X  X  X 


Ol  to  to  to  X 
C  O.  c.  C  © 
co  cc  x  co  x 

—  1-0  Ol  X  — 1 

©  to  to  to  to 

X  X  X  X  X 

r-  x  x  *o  x 

t-0  to  to  to  »o 

X  X  X  X  X 


x  ©  x  —  © 

hNMOC 

X  — I  r-r  -H  X 

NCOO  O 
X  X  X  to  to 


X  Ol  lO  X  X 

o  c.  c  -  c 

to  X  1-0  to  to 

I-  O  X  I-  GO 
C3  o  X  X  X 
X  to  X  X  X 

O  X  C3  — <  X 
©  ©  X  ©  I 
X  X  X  X  • 


©  ©  ©  x  X  x  x  x  x 


O  O  to  X  I 
Of - - 


O  C3  -h  to  ©  ©  — 1 


<  —  -H  to  — I 

:  C3  Cl  X  X 

1  X  X  X  X 

X  X  Ol  X  X 
X  X  1-1-  I  — 
X  X  X  X  X 

x  »o  Ol  X  x 

—  CNN  I  — 
X  X  X  X  X 


'  X  X  to  XXX 


O  1-0  — H  X  Cl 
X  X  X  X  X 
X  X  X  X  X 

©  tO  —  ©  X 
XXX1-1- 
X  X  X  X  X 


'to  cor-xoo  ci  cc- 


83Jnt!>I 


1-0  10  to 
OXI- 

IO  Io  to 

©  I  -  I-  © 

to  to  to  lO 


Ol  Ol  I-  X  CO 

o  ©  ©  ©  © 

to  X  o  ©  o 
Ol  —  Ol  Ol  Ol 
o  c  o  c  o 

*0X000 
01  —  01  01  Ol 
o  o  o  c  c 


COXOr-Cb-  O  Cl  o  -  cc 
to  X  X  X  X  xxxxx 
X  X  X  CO  CO  xxxxx 


1-  I  -  I  —  CC  X 

l-  10  to  to  © 
xxxxx 

O  X  X  Ol  o 

to  X  X  10  X 

xxxxx 

X  1.0  C3  ©  X 

1-0  1.0  X  X  x 

X  X  CO  X  X 


ooc 

obo 


©  C3  X 
O  O  1-0 
XXX 

t-O  ©  Ol 
X  CO  X 
XXX 

XXX 

XXX 


xxxxx 

xxxxx 

xxxxx 

I-  x  O  —  O 

C3  C3  X  X  X 


coo 

I-  1-  l- 

X  X  — 1 


O  O  -X 
XXX 
X  X  X 


G$G3 


480  340  019 


TABLE  1  (Cont.) 

Weight-' Validities  for  Four  Methods  and  Five  Criteria 
(Aro  =  1C5) 


32 


REDUCED  RANK  MODELS  FOR  MULTIPLE  PREDICTION 


© 

00 

1—1 

o 

»o 

CO 

1^ 

Ol 

CD 

Ol 

01 

CO 

Ol 

Ol 

rr. 

ro. 

t.O 

Li 

-p 

CO 

cn 

© 

© 

co 

Ol 

Ol 

r— 

Ol 

Ol 

a 

c 

co 

Li 

Tp 

eo 

co 

Tp 

tp 

Tp 

tP 

rt< 

*-H 

— 

“ 

rr 

-p 

T 

-p 

co' 

co 

-p 

~ 

-p 

00 

! 

J-J 

CO 

CO 

C: 

CO 

Cl 

GO 

CD 

1— 

CC 

( 

o 

GO 

© 

f- 5 

o 

o 

Ol 

— 

o 

o 

o 

Ci 

1- 

Tp 

CO 

1C 

lO 

1.0 

LO 

LO 

LO 

lO 

Li 

id 

Li 

Li 

Li 

to 

Li 

Li 

Li 

Tp 

Tp 

-p 

-p 

*p 

-p 

Tp 

Tp 

OD* 

-p 

Ol 

co 

CO 

CO 

IO 

O 

CO 

LO 

in 

iO 

CO 

o 

,_4 

1^ 

in 

00 

o 

o 

CO' 

/ 

1- 

OC 

CD 

CD 

CD 

co 

Ol 

co 

Ol 

© 

© 

© 

co 

T 

lO 

Tp 

Tp 

”P 

o1 

•r* 

" 

“ 

-r 

O’ 

-p 

■~r 

T 

T 

“ 

•r 

”T 

*p 

co 

co 

CO 

© 

CO 

r>- 

CD 

CD 

ro 

cr> 

cn 

Ol 

Ol 

CM 

Li 

CO 

-H 

o 

to 

C3 

h- 

j— , 

1- 

© 

00 

Ol 

f— I 

o 

1  - 

O 

i  * 

I'- 

CD 

LO 

t.O 

Ol 

Ol 

c 

O 

O 

c 

© 

— 

co 

-r 

tp 

— 

T 

-r 

«*p 

~p 

Tp 

tP 

TP 

tP 

TP 

TP 

TP 

TP 

CD 

CD 

o 

o 

CO 

Cl 

o 

1- 

Ol 

o 

t'- 

LO 

CO 

CO 

CO 

© 

© 

CO 

© 

CO 

© 

1^ 

■/ 

CO 

cc 

r- 

V) 

!  - 

r-- 

C C 

CO 

Of) 

00 

c/> 

GO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

L'i 

CO 

co 

CO 

co 

CO 

CO 

CO 

C0 

CO 

CO 

co 

co 

co 

CO 

CO 

o 

>— 1 

Ol 

© 

1-0 

CD 

lO 

LO 

_ 

rV» 

l0 

CO 

o 

CO 

Ol 

— p 

g 

'T 

CO 

Ol 

Ol 

Ol 

01 

oi 

Ol 

Ol 

01 

01 

Ol 

Ol 

Ol 

Ol 

— 

-p 

m 

© 

© 

a 

”T* 

T 

"T 

Tr 

”T* 

"T 

T 

■TT 

rr 

T0 

-r 

— p 

*0* 

■O’ 

-o 

TP 

TP 

TP 

-p 

tP 

Tp 

-p 

TP 

CO 

CO 

cm 

CO 

© 

o 

co 

© 

C'-' 

Ol 

|_ 

o 

CC 

r- 

rv< 

—ft 

-H 

o 

— 

-p 

to 

Li 

o 

CO 

f-. 

1- 

r~- 

Li 

1- 

~l 

Ol 

cr. 

o 

o 

o 

Ci 

o 

Ci 

CO 

00 

CO 

TP 

TP 

TP 

— 

— 

-p 

*0 

-t* 

co 

■O’ 

-f 

-p 

CO 

co 

CO 

CO 

CO 

CO 

CO 

co 

CO 

CO 

I  — 

c 

CD 

o 

o 

I'- 

Ol 

CC 

cc 

— 

— H 

»o 

Ol 

Ol 

m 

10 

fs. 

h- 

00 

CO 

l>- 

a 

c 

o 

o 

o 

Ci 

GO 

oo 

CT) 

cn 

© 

Tp 

*0 

-r 

CO 

CO 

CO 

CO 

CO 

CO 

co 

© 

lo 

LO 

o 

r- 

Tp 

n 

o 

CO 

lO 

o 

CO 

to 

CO 

CO 

CO 

© 

DC 

c 

CD 

C5 

Ci 

00 

CO 

CO 

CO 

GO 

/ 

Of) 

i.O 

Li 

1-0 

CO 

LO 

i-0 

LO 

© 

o 

CD 

CD 

CD 

o 

CD 

‘b 

Li 

Li 

LO 

LO 

Li 

to 

Li 

Li 

to 

Li 

Li 

CO 

© 

00 

o 

co 

CD 

© 

LO 

cc 

O 

01 

CO 

-P 

o 

01 

01 

CO 

Of) 

CO 

r-- 

r-- 

r^- 

Ol 

■ 

-P 

CO 

Tp 

r-- 

© 

© 

c 

O 

CT 

o 

o 

o 

o 

o 

r. 

C5 

© 

© 

CO 

u 

Li 

Li 

Li 

lO 

Li 

Li 

ib 

lO 

io 

Li 

no 

CD 

o 

io 

CD 

CD 

o 

to 

Li 

to 

LO 

Li 

L0 

Li 

to 

to 

Ol 

CO 

ffH 

CO 

LO 

1- 

Ol 

.01 

1^ 

o 

01 

l- 

*— 4 

CD 

Li 

_ 

Ol 

^4 

—4 

Ol 

co 

to 

Li 

o 

© 

o 

CO 

o 

y 

00 

CO 

i  - 

1- 

CO 

ai 

66 

/ 

CO 

o 

Li 

1.0 

IQ 

CD 

lO 

lO 

CD 

Li 

L0 

Li 

Li 

Li 

Li 

Li 

LO 

Li 

Li 

Li 

Li 

L0 

Li 

Li 

Li 

to 

to 

r— 1 

CO 

© 

LO 

CO 

OC 

O 

© 

CO 

CD 

-M 

o 

-P 

o 

o 

CO 

CD 

-p 

-p 

U- 

CO 

r- 

© 

01 

co 

Li 

Ol 

© 

© 

oo 

CO 

1- 

CD 

C/J 

C/: 

Li 

L0 

CO 

LO 

CD 

CD 

1- 

I'- 

1- 

r- 

oo 

CO 

a; 

GO 

© 

Li 

Li 

L0 

Li 

Li 

lO 

Li 

Li 

Li 

Li 

io 

no 

LO 

to 

to 

Li 

Li 

Li 

Li 

Li 

Li 

Li 

Li 

Li 

Li 

Li 

F- 

F- 

to 

co 

co 

Ol 

lO 

CD 

Ol 

Ol 

i- 

CD 

CO 

GO 

00 

© 

o 

© 

co 

o 

o 

o 

1- 

&> 

ch 

r- 

oo 

r-. 

CO 

OC 

CO 

GO 

CO 

Of; 

I'* 

t'- 

r- 

t- 

c/ 

CO 

CO 

co 

co 

CM 

Ol 

co 

Ol 

Ol 

01 

Ol 

Ol 

01 

Ol 

Ol 

Ol 

Ol 

Ol 

Ol 

Ol 

01 

01 

Ol 

Ol 

Ol 

Ol 

CM 

CO 

co 

co 

© 

r- 

0 

1- 

CD 

LO 

CD 

Li 

CO 

-* 

1- 

Ol 

Ol 

CO 

© 

C-. 

© 

Sb 

00 

o 

© 

o 

CD 

CD 

Li 

to 

to 

Li 

c 

© 

© 

CO 

co 

CO 

co 

CO 

co 

co 

co 

CO 

co 

CO 

CO 

co 

co 

co 

CO 

CC' 

CO 

CO 

CO 

co 

co 

CO 

co 

CO 

CO 

01 

1—4 

Ol 

—4 

-44 

CO 

© 

r- 

© 

CO 

1^- 

C: 

-ft 

— 

CO 

-f 

o 

CD 

I-- 

1-^ 

1.0 

o 

© 

I'— 

00 

Cl 

Ol 

~P 

CO 

•p 

Ol 

01 

lO 

0. 

o 

c 

.0. 

C/J 

00 

a 

1^ 

CO 

CO 

CO 

CO 

co 

co 

CO 

CO 

Ol 

oi 

01 

01 

Cl 

01 

Ol 

Ol 

Ol 

Cl 

CM 

Ol 

© 

to 

CO 

r- 

cc 

CM 

^v-1 

LO 

—4 

r— 

-b 

h- 

Ol 

— 

CD 

Pi 

—4 

Li 

,-•) 

oo 

© 

CO 

© 

OO 

to 

Li 

CO 

lO 

LO 

Ol 

x 

CD 

/ 

CO 

CO 

CD 

1- 

CO 

J 

CO 

I'- 

1- 

(X) 

1  — 

1- 

u- 

1  - 

© 

Ol 

CO 

CO 

CO 

01 

oi 

Ol 

M 

oi 

Ol 

Ol 

Ol 

Ol 

Ol 

Ol 

Ol 

O! 

Ol 

Ol 

Ol 

CM 

Ol 

CM 

CM 

© 

Li 

o 

Ol 

l'- 

© 

r^- 

CO 

01 

Ol 

01 

o 

CO 

r>- 

r- 

CD 

CO 

CO 

Ol 

Li 

lO 

LO 

r- 

CD 

Li 

Li 

t.O 

Li 

| 

Li 

Li 

o 

»-0 

LO 

LO 

lO 

LO 

IO 

LO 

Li 

IO 

Li 

Li 

Li 

Li 

LO 

Li 

LO 

LO 

LO 

Li 

Li 

to 

to 

Li 

Li 

Li 

> 

Ol 

I"- 

00 

00 

CO 

co 

r-- 

o 

01 

O 

m 

Ol 

CD 

o 

oo 

h- 

Li 

CD 

CD 

u- 

o 

© 

© 

00 

D 

CO 

-0 

1- 

1  - 

Of) 

CO 

CO 

o 

Ol 

Ol 

o 

o 

o 

o 

00 

1- 

i- 

1- 

1- 

GO 

1- 

1- 

Li 

1  • i 

»o 

Li 

»o 

to 

Li 

Li 

Li 

CD 

CD 

CD 

CD 

CD 

Li 

CD 

CD 

CD 

Li 

Li 

Li 

to 

Li 

Li 

Li 

Li 

Li 

Li 

Li 

Li 

t 

t"- 

© 

Li 

rn 

Ol 

,—1 

-v 

CD 

LO 

-V* 

oo 

Ol 

CO 

CD 

1.0 

C5 

h- 

© 

u- 

I'- 

© 

Cl 

* 

© 

ir: 

Li 

Li 

Li 

Li 

to 

lO 

LO 

»C 

L0 

LO 

L0 

LO 

LO 

Li 

»o 

Li 

Li 

Li 

Li 

Li 

L0 

Li 

Li 

Li 

Li 

Li 

Li 

LO 

LO 

Li 

Li 

© 

lO 

CO 

Ol 

CO 

CO 

CO 

»o 

CD 

CO 

OC 

CD 

O 

Ol 

00 

-44 

— u 

~«4 

CO 

Ol 

t- 

© 

© 

1-^ 

I'- 

© 

CD 

—4 

CO 

o 

i  . 

Li 

lO 

l" 

Li 

CD 

CD 

CD 

CD 

CD 

Li 

Li 

>o 

Li 

Li 

Li 

GO 

1 

i_o 

LC 

Li 

LO 

LO 

i_0 

LO 

io 

LO 

LO 

Li 

LO 

1C 

LO 

Li 

LO 

LO 

Li 

to 

Li 

«o 

Li 

lO 

lO 

Li 

to 

Li 

Li 

Li 

t- 

© 

'C 

m 

c 

i-H 

CM 

CO 

lO 

t- 

cc 

Ol 

co 

LO 

00 

o 

Ol 

CO 

-p 

© 

I'- 

00 

© 

0 

-- 

Ol 

Ol 

Ol 

Ol 

01 

CM 

Ol 

Ol 

CM 

Ol 

D 

o 

s:}iUT?>I 

TABLE  1  (Cont.) 

Weight- Validities  for  Four  Methods  and  Five  Criteria 
(Vo  =  120) 


GEORGE  R.  BURKET 


o  ci  — i  ©  ©  ©  ci  co  co  ©  ©  co  -r  ©  -r  ©  ©  ©  ©  ©  ci  ©  ©  co  o  c  ^ 

-t<  ©  ©  i-  i-  '  oi  r-<  O  — 1  c.  co  w  m  m  ci  co  ~f  ©  ©  ©  ©  -t  -r  *t  ©  ©  -r 

n<  rt<  -rji  co  co  cc  co  co  CO  CO  CO  CO  CO  cococococo  cococo 

O  N  -I  00  O  CO  CO  C  C.  00  COOOOw  CO  -t*  ©  CO  CO  R - -  ^  rn  q  O  ©  r. 


CJ-iQWCi  O  C 
1  ©  — <  O  C5  CO  N  O  I'  I 

I  TT  rp  COCOCOCOCO  CO  C 

)  R-  CCC  tNii  C5  » 

'CIO  CO  N  ©  C  O  lOI 

1  ^  CO  COCOCOCOCO  CO  c 


lO  't1  ©  N  ’f  ©  ©  -*  i-l  ©  ©  ©  ©  ©  H  C5  o  C  H  O  -H  ©  -tR  >-(  rR  -R 

Cl  ^  ^  O  O  o  O  O  00  N  CO  ©  ©  ©  ©  ©  ©  ©  ©  ©  O  O  >-H  1-H  1— (  Cl  H  1H 

-R  CO  TjH  rti  Tfi  CO  CO  CO  CO  tR  tR  CO  CO  CO  -tR  CO  -R  -3<  -3<  -R  -R  *T  tR 


>OCO  00  H  CO  D1  00  ©  ^  I 
CM  (M  CM  — <  i  CM  Cl  Cl  Cl 

-R  -R  rR  tR  rR  ©  ’ 

©  ©  CM  -R  00  CO  CO  i 
O  Cl  N  N  C5  CO  ’-t  I 
co  CO  co  CO  co  ^  rr<  • 


Q  O  O  H  CO 
©  ©  ©  co  o 
CO  tR  tR  rR 


O  M  lO  N  ©  N*COCOC2  ©  ©  CM  CM  ©  R-  ©  »-h 

i".  r~  r>.  oo  oo  co  ©  co  co  co  t  -r  —  -r  -  cc  :-i 

COCOCOCOCO  CO  CO  rf.  -R  -R  Tji  ti<  r}i  tR  -r  tR 


CO  CO  Cl  Cl  CO  CO  «0  CO  h-  ^  C5  N  CO  IO  ©  ©  -R  © 


CO  >C  CO  h-  ©  R»  CO  ©  ©  ©  -R  O  — i  -R 
'  CO  CO  CO  CO  0101010101  Ol  Ol  CO  CO  01 
T"  O  ^  *7  -R  tR  '^Trl-T-f’-T1 


li-iNCOO  ©  -t  Ol  COO  O  N  W  1<  O  O  -h  Ol  ' 

O  O  C  O  Ol  CO  CO  CO  Ol  CO  CO  CO  Ol  Ol  Ol  —  - 

CO  CO  CO  rr  -<R  -*R  -?<  TR  -R  1"?  T  H  7  rR  -R<  -R  ' 


©  GO  CO  GO 


O  Cl  CC  CO  -7 
Ol  01  O  CO  o 
t©  i-O  ‘O  LCO  © 


CO  -11  R-  ©  CO 
©ON©  tO 

•o  ©  ©  ©  © 


R"  co  ’-t  ro  oi 

©  ©  i^  © 

tO  to  to  tO  to 


CO  ©  N  O  O 

»o  to  to  o  u: 
to  to  to  to  to 


C  C  O  ©  N  NNO 

to  to  to  to  to  to  to  to 


©  00  to  O  CO 
Ol  Ol  Ol  I"  -R 
to  to  to  to  to 

O  CO  h  CO  O 
tOCCOCOO 

to  to  O  to  o 

I o  O  O  -1 

CD  I—  I'-  00  01 

Tji  to  to  to  CO 


CO  to  to  Ol  © 
ID  CO  CD  R'-  CO 

to  to  to  to  to 

H  ol  -*  C  © 
H-OOCO 
CD  O  CD  to  tO 

— •  -R  — 1  — 1  Ol 
—  ©  CD  ©  © 

©  to  to  to  to 


N  O  Ol  h  h 

00  00  CO  CO  00 

to  to  to  to  »o 

CD  CO  "R  co  »o 
CO  00  CO  I-  I- 

to  to  to  to  to 

D  C  c:  N  IO 
00  1-  1-  I-  I  - 

to  to  to  to  to 


to  CD  CD  to  to 

Ci  CO  N  N  1- 

to  to  to  to  to 

R-  to  CO  -R  CO 
R'*  I'-  I  —  R-  I  ■» 
tO  to  to  to  tO 

C  O  ©  N  C! 
N  N  ©  CO  © 

to  to  to  to  to 


i  IO  to  to  to  to  to 


)  to  to  to  to  to  to 

)  N  CD  C  ©  N  I>  N 

)  ©  ©  N  CD  CD  ©  CD 

)  tO  IO  to  tO  to  'O  to 


00  ©  ©  ©  © 
00  CO  to  CD  l  ^ 
cm  ©I  ol  oi  <n 


©  CO  CD  to  R- 
CD  CO  tO  CD  1>- 
Cl  Cl  Cl  Cl  Cl 


Ol  CD  Cl  to  R- 
01  Ol  Ol  Ol  Cl 


CC  CO  I'  -f  oo  coco© 
tO  tO  *o  10  *o  to  *0  to 
Cl  CM  Cl  Cl  Cl  01  Cl  Cl 


eo  r-  to  to  to 
CO  CO  OC  00  to 
CO  CO  CO  CO  CO 

»o  to  ©  Cl  CO 

i-  ©  —  co  -r 

Cl  CO  CO  CO  CO 

to  to  CO  CO  © 
1-  co  ©  Cl 
Cl  CO  Cl  CO  CO 


(s.  ^  rr  fR  r\) 

—  ci  ^  ©  cs 

CO  CO  CO  CO  Cl 

to  to  CO  N  © 
Cl  rs  -H  o  C 

CO  CO  CO  CO'  CO 


-h  CM  00  00  GO 

r»  co  I  -  -r 

co  co  co  co  co 

—  __i  r-i  in  ci 
R-  1^  ©  R-  I- 
Cl  Cl  Cl  Cl  CM 

h  h  N  CO  h 
©  i- 

CM  Cl  Cl  Cl  Cl 


©  O  H  ci  D 
©  ©  ©  -r  -t 
co  co  co  co  co 

©  ©  w  —i  »<**. 

N  O  O  ©  O 
Cl  CM  Cl  Cl  Cl 

to  H  — <  ®  to 
©  ©  CD  CD  © 
Cl  CM  Cl  CM  Cl 


>  R>  00  O  t  -  R~ 

5  ©  Cl  01  ©  to 

3  CO  CO  CO  Cl  CM 

!NO  ©  00  © 

/  ©©  ©  to  to 

l  Cl  Cl  Cl  Cl  Cl 

I  -r  -r  O  CO  ©  © 

1  ©  ©  ©  to  to  to 

I  Cl  Cl  Cl  Cl  Cl  Cl 


R—  R-  CM  © 

to  ©  CO  00  © 

©  ©  ©  tR  tR 

R-  -Ji  to  ©  to 

to  ©  to  ©  !"• 

tO  ©  ©  ©  © 

r-  co  -r  — i  co 

©OHC'l  D 
CO  ~R  ©  ©  ”R 

R-  GO  "R  ©  O 
©  rR  — i  © 
CO  "R  ©  ©  ~R 


■HfCCdCOD  I''-  CO  -R  ©  ©  CM  CO  CO  CO  CM  Cl©© 
•—©©©©  CO  ©  ©  ©  00  CO  I'-  I-  I-  [--  l-  I-  I- 

©  ©  ©  ©  rR  tR  tR  *?R  -R  -R  rR  -R  ^R  -R  tR  -R  -R 

©  H  co  Cl  N  "t  M  •?  ci  N  R-  ©  ©  ©  ©  CO  t-  CO 

©  R-  R>.  R-  N  CO  CO  Cl  Cl  —  ©  ©  ©  CO  I>-  I'-  1^ 

©to©©©  ©  ©  ©  ©  ©  ©  *r  t  ~r ’t1  -r  i  — * 

N  CO  Cl  O  CO  C0©©©*^  Cl  ©  r7  CO  O  —  Cl 

ci  ci  ci  ©  —  — ’  ©  ©  ©  ©  co  i-  i-  t-  i~  i-  i--  r  - 

©  ©  ©  ©  ©  ©  ©  ©  -R  -R 


©oo©^—!  ©co-^tooo  ©yoci©c 

©©©!>.  1>-  1-  CO  ©  ©  CO  1-  I--  I  -  I-  I 

I'  H  Tji  -f  • 


rH  ©  ©  © 
I'-  ©  1-  R- 
-f  *T< 


s^utjy; 


704  070  788  029  092 

688  516  097  558  589 


TABLE  1  (Cont.) 

Weight-Validities  for  Four  Methods  and  Five  Criteria 


34 


REDUCED  RANK  MODELS  I'OR  MULTIPLE  PREDICTION 


CO  CC  CC  >C  CC  ION  c  T  CO  -P  CO  C7  O  M  ‘C  N  O  -  i-O  iO  O  N  r-  00  Cl 

co  rc  ci  rc  co  c  o  rc  ci  -  co  co  co  co  i-  rc  — •  — <  c  — ,  ci  ci  — <  -h  -4  — <  -h 

cc  'O  lo  io  *r  -r-r-p-r-r  cc  co  co  cc  cc  cc  rc  cc  cc  co  co  co  rc  cc  cc  co  co  co 


m  ::  «o  >c  ■ 


i-r  -P  -t  co  rc  cc  co  cc  cc 


i  —  O  to  CO  o  •->  cc 


;  I—  CC  O  O  CO  Cl  Cl  iC  CC  O  ' 


CC  CC  ”T  Cl  CC  Cl  — P 


Cl  Cl  Cl  Cl 


CMC  —  CI-hOO  OCOIO  NCI  CO  Cl  M  C  Cl 


:  Cl  Cl  —  Cl  : 


Cl  CC  Cl  Cl  — 


ice  rc  cc  ci  cc  cc  cccccccccc  ccccccccro  ccccccco  t-.  o 


l  —  Cl  O  Cl  COQ  -  O  — '  C-  C  C  CC  CO  C  Cl  C  H  C  M  Cl  O  -  >0  -P  O 

C.  h  Cl  C  -  -I  CO  «  O  0  1-000  l-«  O  H  o  o  ococo  ooo 

•  rc  cc  ci  ci  ci  ci  —  ci  ci  ^  —  ci  ^  ci  *-<  ci  ■—  ci  ci  ci  ci  ci 


:  NNX  CO  Ci 


;  —  —  rc  Cl  1 


”?  T  to  >0  -r  C  7  c  ' 


or — "  co  o 
I-  CO  C  -  O 
-r  cc  rc  ci  ci 


cc  r-  ci  i-  ci 

iQ  -p  <C  Cl  Cl 
Cl  Cl  Cl  Cl  Cl 

M  O  «5  io  ci 

Cl  Cl  Cl  Cl  Cl 


I--  CO  CO  —  o 
rc  >o  o  «o 

Cl  Cl  Cl  Cl  Cl 
M  N  -  CO  C 
Cl  Cl  Cl  Cl  Cl 


’toe  Cl  — > 
Cl  Cl  Cl  Cl  Cl 

Cl  Ci  >0  )$  «-< 
Cl  Cl  Cl  Cl  Cl 


ci  ci  ci  cc  a 
i  -  cr.  <  o  co  — 
Cl  Cl  Cl  Cl  Cl 

Cl  -H  JO  to  o 
o  o  o  c  o 

Cl  Cl  Cl  Cl  Cl 

>o  o  r — p  io 

-H  —  o  o  o 

Cl  Cl  Cl  Cl  Cl 


et-To 

ooo 

Cl  Cl  Cl 
O  M  M  e 

o  o  o  o 

Cl  Cl  Cl  Cl 


n  C  N  M  -  Ci  t  o  N  ccc 

-  co  o  e  iq  i-  cc  rc  Ci  ci 

IO  >0  to  io  ‘O  to  1.0  to  to  to 


o  rc  ci  o  ^  cc  Ci 
5  M»OCN  O  t-o 

to  -O'  '~P  T  ”P  -P  ~P 


Cl  Cl  oc  o 
e  *t  ci  e  i- 
to  to  to  to  to 


1.0  io>  t'o  1-0 


O  Cl  N  ~  CO  CO  O  I  -  —  i  o  Cl  Cl 
“  *  O  »0  to  C  N  M  M  Cl 
~  to  to  to  to  to 


o  to  o  Cl  H  rc  ci  ci  «o  to  co  c  cc  -  e  00  ;2  £ 


O  -P  - ,  . 

to  *0  to  to 

CO  ■ — 1  CC  O  Cl 
to  cc  CO  O  CO 
T  to  to  to  IO 


c.  CC  CO'  I  -  1^  o  c 

to  uo  IO  to  to  *'*' 

CO  ci  to  -H 

oc  co  c  a  »o 

to  to  to  to  *o> 


to  *o  to  -  .  .. . 
io  to  to  »o  to 

MM  -H  Oca 
o  e  e  ri  c. 
to  to  to  to  -p 

•  C.  ?1  C  l>-  Cl  C  Cl  cc 

I  —  O'-  C  c  cc  I'  O  to 

:  to  *r  to  »o  -r  *r  t  -r 


cr  o  -p  ©  o 
CC  I-  I-  cc  o 


CO  O 


rc  cr.  m  ri  -  ~  ~ - rc  ci  o  —  cc  i-  ci  >o  — <  ; 

^p  ci  ci  ci  ci  ci  ci  ci  ci  ci  ci  ci  ci  ci  ci  c-i  ci  ci  > 


-  b  co  i-  06 

rc  rc  ci  ci  ci 

-h  cc  m  o  e 

O  -P  1—  I  —  o 

Cl  Cl  Cl  Cl  Cl 

c  O  CO  CO  Cl 
C  C.  C  C  N 
rc  ci  Cl  ci  ci 


i-  c  cc  or  rc 

to  CO  CO  I - 

Cl  Cl  Cl  Cl  Cl 


w  M  C  CO 
ci  rc  rc  -r 
Cl  Cl  Cl  Cl 


rc  Ci  -p  o  co  co  io 

O  ci  o  o  o  ooo 

Cl  -H  Cl  Cl  Cl  Cl  Cl  Cl 


ci  1.0  -•HN 

-  e  ci  o  c 

Cl  Cl  Cl  Cl 


Cl  Cl  Cl 

I  —  CC  -p  to  o 
tc  *0  1.0  to  — 
Cl  Cl  Cl  Cl  Cl 


-  c  e  ci  m 

«v»  rsj  rvj  —j 

ci  Cl  Cl  Cl  Cl 


Cl  Cl  Cl  Cl  Cl 

Cl  Cl  — <  O  Cl 
Cl  Cl  Cl  —  — 
Cl  Cl  Cl  Cl  Cl 


ci  >o  co 

—  o  o 

Cl  Cl  Cl 

O  O  OC  00 

—  —  o  o 

Cl  Cl  Cl  Cl 


to  to  »o  to  *o  ”p  *r  r 

Cl  I- 


7  Cl  I'- 
-  I  -  I  - 

7  rc  ro 

rc  i-  ci  ci 
to  io  ’ 


i-  -r  -  o  co 

O  I-  l~-  O  'O 

M  MM  M  M 


o  o  co  -p  o 
Ci  w  rc  —  ci 
cc  rc  rc  rc  cc 


o  ci  >c  -r  rc 
o  co  o  to  a 
rc  cc  cc  rc  rc 


:  — I  to  O  O 
'  cc  cc  cc  cc 


rc  to  CC  CC  CO 
to  to  »o  -r  rc 
rc  cc  cc  rc  cc 


rc  cc  cc  rc  cc 
to  CC  to  -P  o 


CO  cc  o  o  o 
CC  T  cc  rc  CC 

cc  cc  cc  rc  cc 

CO  — 4  h  O  Cl 

I  —  I  —  I—  w  O 

rc  rc  cc  cc  cc 

to  to  o  Cl  o 

Cl  Cl  Cl  Cl  Cl 

cc  cc  rc  cc  cc 


0  0*0 
Cl  Cl  Cl 

cc  cc  cc 

CO  — I  -P 
to  rc  ci 
cc  cc  cc 


•  to  c  n  co  c  : 


to  CO 
to  Cl 
t—  to 


Cl  -p  to  to 

Cl  Cl  Cl  Cl 

co  rc  cc  rc 


to  Cl 
>o  I- 
CO  to 


G5  £ 


TABLE  1  (Cont,.) 

Weight- Validities  for  Four  Methods  and  Five  Criteria 
(Vo  =  30) 


! 


GEORGE  R.  BURKET 


35 


CO 

Ch  Ol 


bC  Ol 

c 

K 


5  »0  CO  CO 

•  co  co 


1  1^-  Ol  I'  W  cc  c  ' 


O  C.  i*  iC 
Oi  Ol  Ol  Ol  Ol 


•  O  ©  ©  GO  F-  CO  -+ 
Ol  Ol  01  01  Ol  Ol  Ol 


I  I  I  I  I  I  I  I  I  III 


i!0  Ol  Ol  CO  ©  CO  c 


IO  l'-  I-  -T  © 
CO  Cr'  'O  '*'1  o 
Ol  Ol  01  Ol  — . 


oo  —  55 

Ol  CO  Ol 


00  01 
~r  co 
01  01 


:  00  ©  ( 

:■  -o  to  i 

1  Ol  Ol  ( 


O  O  -  O  Ol 
i-O  Ol  0C  ©  I- 
Ol  Ol  i 


Ol  Ol  CO  CO  : 


;  oi  cc 

•  oi  — 

f  ©  o 


O  O  01  o  «o 

1— I  o  ©  CO  '"'l 

c  ©  ©  ©  o 

I  I  I 


•  CO  L0  -0  — ' 


o  o  o  © 

Mil 


5  -f  ?  ^ 

Cl  Is*  CO  O  *f 
CC  C  CO  *T  c 
Ol  Ol  Ol  CO  CO 

IOLO'7'1'-  — 
O  I-  1C  O  CO 
CO  CO  CO  CO  CO 


I  I  I 

•rccos 


TT  77  77  7 


I  I  I  I 

1.0  X  — '  —  CC  — <  »C 


C  N  CO  N  CC 
Ol  «  C  -H  o 
CO  CO  CO  CO  Ol 


f  -I  Ifl  00  N 
©  I — r  ©  i- 
CO  01  01  01  — < 

co  -o  ©  >o  oi 

o.  CC  00  CO  oc 
01  Ol  01  Ol  Ol 


CO  01  01 
■  ^  GO  1-0 


o  ©  lo  —  »o 

Ol  01  »-0  —  — i 

Cl  -  cc  r 


N  ~  Cl  C  C  O  —  ^0  -o  ^0  CO  Ol  ^  ~  *0  -t-  ^o  CO  CO  Ol 

Loiolo^to  ic  c  o  c  c  ©  ©  ©  ©  ©  ©  ©  ©  S  c 

Mil  II  II  I  II  I  I  I 

N  Ol  t  SO  CO  »0  -r  ©  F-  00  CO  -T  CO  C  —  F-  C/D  CO  F- 
I-  co  c  C  CC  C  K  N  C.  C  ~  ©  1-0  1-0  CO  Ol  ©  GO  CO  I- 

to  »0  1-0  1-0  1-0  IO  lO  1-0  1C  iO  1-0  1-0  >C  »C  >C  ic  -T  -y  -T  -t< 

h  h  o  I'*  «  -*•  -t  O  — '  O  ©  CO  CO  CO  >C  C  CC  *f  -  N 

O  IO  X  Cl  CO  1—  — <  1^  F-  LO  C  C  O  -  o  C  C  CC  CO  c 

to  t  "M-C  t  i-C  to  -r  o  o  -  -r  -r  cc  co  :o  co  oi  oi 

O  CO  CC  <©  CO  O  N  01  t  C  Ol  1^  O-  O  I-  -  CC  1C  N  - 

-r  -r  — 1  i-o  — ■  oi  co  oo  cc  c  o  CO'  n  ^  c  oi  r.  o  -r 

—  1.0  1C  1-0  IO  1-0  -t  -r  -y  -T  CO  CO  CO  co  CO  01 - — 

N  C.  w  O  01  j:  o  c  p  o.  Ol  —  CX  Ol  ©  ©  CO  ©.  ©  F- 

C0C0--O.  wwwwC  "I  ~  ~  ~ 

—  co  CO  CO  c  ■*  N  l»  C  !•*  X  CO  1C  CO  OC  I"  l'*  I  —  01 

©  01  Ol  T  CO  !>•©©-*•  — - —  w-  I-  CO  X  X  N  c 

co  -r  —  CO  CO  CO  CO  CO  CO  CO  CO  01  Ol  Ol  01  01  —  01 

K  C  Cl  N  to  i-  ^  O  CC  CO  -  X  C  «  ©■  —““!•»  CC  © 

-r~Nwl-  -  I.C  c  X  I-  N  K  i-C  CO  Cl  co  cr  co  —  y. 

CO  co  Ol  Ol  Ol  Ol  01  Ol  Ol  Ol  Ol  Ol  Ol  Ol  Ol  Cl  -  -"H  c 

s-xco  x  o  r  c  oi  2  ©!  9£  7*  ^  -r 

co  co  co  co  oi  —  —  —  —  See  ©  ©  ©  ©  c 

I  I 


I  I  I 


—  oi  oi  . 


o  c  o 

I  I  I 


01  Ol  01 

©  co  © 


I  Mill 


o  1-t  © 

N  *f  x  cc 

01  —  Ol  — 

©  ©  ©  o 

1111 


Ol  01  Ol  —  — 

"  g  s  = 


I  I 


©  © 
I  I 


CO  CO  — 
CO  O!  01 
IO  iO 


■  LO  —  CO  LO 

;  ©  ©  oi  ©  oi 

-  ‘  O  CO  ©  ©  © 
>  ©  ©  ©  ©  © 


II  I  I  I  II 


©  CO  —r 
01  -4  © 


I  I  I  II  II 


*  ~  ©  © 
I  I 


II 


999  999  999  999  999 

003  5S1  093  505  G15 


36 


REDUCED  RANK  MODELS  FOR  MULTIPLE  PREDICTION 


six  original-sample  sizes  used,  ranging  from  255  down  to  30  eases.  This  size 
is  denoted  by  Ar0.  In  caeli  instance,  the  new  sample  eontained  252  cases. 
An  original  sample  and  a  new  sample  were  independently  drawn  for  eaeh 
size  and  eaeh  eritcrion,  for  a  total  of  30  original  samples  and  30  new  samples. 
Since  for  rank  29,  all  methods  are  equivalent  (aside  from  rounding  error), 
the  corresponding  weight-validity  is  listed  only  under  method  1.  The  full- 
rank  (rank  29)  multiple  correlations  for  eaeh  sample  are  also  listed  under 
method  1,  the  subscripts  0  and  1  denoting  the  original  and  new  samples, 
respectively. 

Although  the  weight -validities  using  method  4  were  computed  on  the 
basis  of  the  data  given  above,  they  are  not  presented.  For  all  ranks,  eriteria, 
and  sample  sizes,  these  weight -validities  were  substantially  lower  than  those 
for  any  other  method  or  for  the  full-rank  weights.  They  were  frequently 
negative,  rarely  greater  than  .10,  and  virtually  always  less  than  half  as  large 
as  the  weight -validities  obtained  by  any  of  the  other  methods.  Our  expecta¬ 
tion  that  the  method  of  smallest  prineipal-axes  faetors  would  give  less  aeeurate 
predictions  than  the  other  methods  is  thus  unequivocally  confirmed. 

To  assist  in  comparing  the  other  four  redueed-ranlc  methods,  Table  2 
was  prepared  from  Table  1.  For  eaeh  original-sample  size  and  eaeh  criterion, 
two  comparisons  are  made.  In  eaeh  of  the  first  five  columns,  the  number  of 
ranks  for  whieh  eaeh  method  was  superior  to  the  other  three  methods  is  given. 
In  making  the  eounts,  ties  were  divided  equally  among  the  methods  sharing 
the  high  value  for  a  particular  rank.  In  eaeh  of  the  seeond  five  eolumns  of 
Table  2,  the  number  of  ranks  for  whieh  a  particular  method  was  superior  to 
the  full-rank  weights  is  given.  When  for  a  particular  rank  a  method  had  the 
same  weight-validity  as  the  full-rank  weights,  the  count  was  increased  by  one 
half. 

Of  the  four  methods,  the  method  of  largest  prineipal-axes  factors  most 
often  gave  the  highest  weight-validities  in  26  of  the  30  samples.  This  trend 
was  most  marked  when  the  weights  were  eomputed  on  smaller  samples, 
particularly  samples  of  size  30.  The  only  exceptions  oeeurred  for  samples  of 
210  and  255  eases.  The  superiority  of  method  3  was  most  pronouneed  for 
Psychology  and  Mathematics  and  less  elear-eut  for  English  Composition  and 
Chemistry.  Method  3  was  also  more  often  superior  to  the  full-rank  weights 
than  were  the  other  methods.  Thus  it  appears  that  our  expectation  as  to  the 
superiority  of  method  3  is  also  eonfirmed,  but  with  the  qualification  that, 
for  larger  samples  and  for  eertain  criterion  variables,  one  or  more  of  the  other 
methods  may  be  preferable. 

Another  possible  basis  of  comparison  would  be  the  number  of  samples 
for  whieh  a  particular  method  gave  the  highest  weight-validity  for  any  rank. 
Of  the  30  samples,  method  3  gave  the  highest  validity  in  12.5,  method  5  in 
8.5,  method  1  in  5,  and  method  2  in  4  samples.  The  comparisons  of  Table  2 
would  appear  to  be  more  meaningful  than  this  comparison,  however,  sinee 
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REDUCED  RANK  MODELS  FOR  MULTIPLE  PREDICTION 


the  outcome  of  the  latter  would  presumably  be  much  more  subject  to  random 
variability  of  weight-validities  from  rank  to  rank. 

In  Table  3  are  presented  data  from  ten  additional  original  samples  from 
the  criterion-1  (All-University)  population,  with  sizes  ranging  from  435  down 
to  30  cases.  Here  all  sets  of  weights  from  each  original  sample  were  eross- 
validateu  on  two  new  samples,  where  again  each  new  sample  consisted  of 
252  cases.  Total  squared  errors  of  prediction  are  presented  as  well  as  weight- 
validities  for  each  of  the  20  new  samples.  Method  4  was  omitted  from  this 
phase  of  the  computations.  At  the  bottom  of  each  page  of  Table  3  are  given, 
in  addition  to  the  original  sample  size  N0,  the  full-rank  multiple  correlations 
for  the  three  samples  represented  by  that  page;  these  are  denoted  by  7?„, 
and  R2  for  the  original  sample,  first  new  sample,  and  second  new  sample, 
respectively. 

Since  the  criterion  variable  (as  well  as  the  predictors)  was  normalized 
before  the  computations  were  earned  out,  the  total  squared  errors  of  pre¬ 
diction  are  comparable  from  sample  to  sample  as  well  as  from  method  to 
method  and  rank  to  rank.  Expressed  in  normal  deviates,  the  criterion  mean 
is  zero  and  the  sum  of  squares  is  one.  Thus  if  a  prediction  of  zero  were  made 
for  each  case,  without  ever  going  to  the  trouble  of  computing  regression 
weights,  the  total  squared  errors  of  prediction  would  bo  one.  Since,  for 
example,  the  total  squared  errors  of  prediction  using  the  full-rank  weights 
from  an  original  sample  of  size  75  are  greater  than  one  in  both  new  samples, 
it  appears  that  this  particular  regression  equation  is  worse  than  useless.  Yet 
for  this  same  sample  the  rank-1  errors  for  method  3  of  .737  and  .055  are 
actually  lower  than  either  of  the  full-rank  errors  obtained  for  the  sample 
of  390  cases,  which  were  .767  and  .745.  In  general,  it  may  be  seen  that  the 
lower-rank  errors  obtained  with  method  3  using  small  original  samples  com¬ 
pare  favorably,  or  at  least  not  unfavorably,  with  the  full-rank  errors  obtained 
using  large  original  samples.  A  similar  trend  may  be  noted,  though  not  so 
clearly,  with  regard  to  weight-validities. 

Table  4  was  prepared  from  Table  3  in  a  manner  analogous  to  the  prepara¬ 
tion  of  Table  2  from  Table  1.  Here,  of  course,  only  one  criterion  variable 
is  involved,  and  the  comparisons  are  made  with  respect  to  total  squared 
errors  of  prediction  as  well  as  to  weight-validities.  For  the  larger  original- 
sample  sizes,  the  outcomes  of  the  comparisons  arc  not  appreciably  affected  by 
the  index  of  accuracy  used.  For  the  smaller  sizes,  however,  the  total  squared 
errors  of  prediction  tond  to  favor  method  3  over  the  other  methods  o.nd  the 
lower  ranks  over  the  higher  to  a  greater  extent  than  do  the  weight- validities. 
In  the  present  series  of  samples,  just  as  in  the  preceding  series,  method  3 
appears  to  be  definitely  superior  to  the  other  methods.  And  even  for  the 
largest  original-sample  sizes,  method  3  appears  preferable  to  the  full-rank 
system. 

It  appears  that  method  3  could  be  used  to  considerable  advantage  in 
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TAIilJ  4 

Comparison  Between  Four  Reduced-Rank  Methods  With  Respect  to  Weight- Validities 
and  Total  Squared  Errors  of  Prediction  for  a  Single  Criterion 


Number  of  ranks  for  which  Number  of  ranks  for  which 

index  is  superior  index  is  superior 

Sample  to  other  methods  to  full-rank  method 

Size  Methods  Index  ll'i  11'2  TI’i  <pi  Wi  ife 


1 

2.33 

2.33 

.25 

0. 

0.5 

6.5 

10.5 

11 

435 

2 

3.33 

3. S3 

.25 

0. 

3.5 

5. 

S  .5 

10 

3 

21.5 

21 . 

20.75 

27.5 

IS. 

20. 

27.5 

28 

5 

.83 

.S3 

.75 

.5 

0. 

0. 

17.5 

10.5 

1 

1.33 

1 . 

0. 

0. 

S. 

7.5 

2. 

o 

390 

2 

2.33 

2 

.33 

.5 

IS. 5 

IS. 5 

0 . 5 

S. 

3 

19. 

20.5 

14.33 

15.5 

24. 

24.5 

20.5 

22.5 

5 

5.33 

4.5 

13.33 

12. 

10. 

17. 

24. 

25. 

1 

.S3 

.5 

2.5 

1 .5 

12. 

10. 

20.5 

24. 

345 

2 

3.83 

3.5 

3.5 

4. 

10. 

9.5 

20. 

22. 

3 

20.83 

21.5 

20.5 

22 

IS. 

21. 

27. 

27. 

5 

2.5 

2.5 

1  5 

.5 

5. 

4. 

20.5 

21.5 

1 

1 . 

1 . 

2 

2. 

6.5 

5 . 

6. 

6. 

300 

o 

0. 

0. 

3. 

3. 

12.5 

12. 

11.5 

11.5 

3 

24. 

24 

18. 

IS. 

27. 

27. 

20. 

20. 

5 

3. 

3. 

5. 

5. 

20.5 

20.5 

10. 

10.5 

1 

1. 

1 . 

2.5 

2 

11.5 

13.5 

16.5 

14. 

255 

2 

2 

2 

10.5 

S. 

13. 

14.5 

19.5 

20.5 

3 

23. 

23. 

4. 

0.5 

24. 

24. 

8. 

21. 

5 

2 

2 

11. 

11.5 

14.5 

14.5 

21.5 

27. 

1 

.33 

.33 

2. 

1  5 

3. 

5.5 

S. 

13.5 

210 

2 

.33 

1.33 

2. 

1.5 

5.5 

0.5 

9.5 

14.5 

3 

21 

22. 

21.5 

22.5 

24. 

24. 

25. 

26. 

5 

6.33 

4.33 

2.5 

2.5 

18.5 

21.5 

1  1.5 

14 

1 

4.33 

4.5 

0. 

0. 

7 . 

8.5 

IS. 5 

20.5 

105 

2 

3.  S3 

5.5 

i. 

1. 

4. 

6.5 

19.5 

24. 

3 

11.5 

14.5 

20  5 

26.5 

15. 

21. 

27. 

27. 

5 

8.33 

3.5 

.5 

.5 

22.5 

22. 

6.5 

S. 

1 

1 

1. 

1.5 

0. 

15. 

10.5 

19.5 

20. 

120 

2 

0. 

0. 

2.5 

2 

11. 

13. 

14.5 

10. 

3 

20.5 

20.5 

22.5 

24.5 

27. 

27. 

24. 

20. 

5 

.5 

.5 

1.5 

1.5 

16. 

17.5 

23.5 

25.5 

1  5.33  0.  0.  0.  18.  27.5  20.  23.5 

2  3  33  1  150  17  5  27  28.  2G.5 

3  18.5  2G.5  20.  27.5  20.5  27.  28.  28. 

5  .83  .5  .5  .5  12.  25.  2S.  20.5 


0 

0. 

0. 

0. 

27. 

28. 

27. 

2S 

9. 

0. 

2. 

0. 

28. 

2S . 

28. 

28 

17.5 

20.5 

25. 

26.5 

28. 

28. 

2S. 

28 

1.5 

1.5 

1. 

1 .5 

25.5 

25. 

24 

20 

30 
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either  of  two  situations.  The  first  would  be  where,  for  a  given  original-sample 
size,  one  wanted  the  greatest  accuracy  of  prediction  obtainable.  The  other 
would  be  where,  for  a  given  accuracy  of  prediction,  one  wanted  to  use  the 
smallest  possible  original  sample.  In  order  actually  to  compute  the  coefficients 
for  a  reduced-rank  prediction  equation,  however,  one  has,  of  course,  to  select 
the  particular  rank  to  be  used.  To  provide  some  indication  as  to  how  satis¬ 
factory  the  statistics  11'  and  $  would  be  for  this  purpose,  they  are  computed 
for  the  original  samples  of  Table  3  using  (40)  and  (90),  respectively.  They 
were  computed  only  for  method  3,  since  the  other  methods  are  dependent 
on  the  criterion  observations  for  order  of  selection,  contrary  to  the  assump¬ 
tions  used  in  deriving  the  above  statistics.  These  estimated  values  for  weight- 
validities  and  total  squared  errors  of  prediction  arc  given  in  Table  5.  To 
facilitate  comparisons,  the  obtained  values  from  Table  3  are  reproduced  in 
the  adjacent  columns.  At  the  bottom  of  each  page  arc  given  the  original- 
sample  size  and  the  full-rank  multiple  correlations  for  the  two  cross-valida¬ 
tion  samples.  The  multiple  correlation  and  the  estimated  population  correla¬ 
tion,  from  (32),  in  the  original  sample  arc  given  for  each  rank.  The  column 
headed  <2  is  an  estimate  of  the  standard  error  of  and  may  be  derived  as 
follows.  We  let  a  be  a  column  vector  composed  of  the  elements  of  z2  and  z3 
in  (91).  Then  we  may  write 

„„n  ?  N  +  L  , 

(135)  =  -N  _  L  a'a, 

where  the  elements  a;  of  a  arc  independently  distributed  with  mean  zero  and 
variance  a.  The  variance  of  a'a  will  be 

(130)  Var  (a'a)  =  E[(a'aY}  -  [. E(a'a)]2. 

Under  the  reduced-rank  hypothesis,  a'a  will  be  simply  the  error  sum  of 
squares  in  the  original  sample,  so  that  from  (71),  the  second  term  on  the 
right  of  (130)  will  be 

(137)  [E(a’a)]2  =  [(A  -  L)<r2.2  =  (N  -  L)V. 

Expanding  the  first  term  on  the  right  of  (130),  we  obtain 

(138)  £[(a'a)2]  =  (iV  -  L)E(a ‘)  +  (N  -  L)(N  —  L  —  1  )A(a-a2),  i  ^  j. 

•Since  the  o-,  are  inHpppnrlon f  wp  hnvp 

(139)  £(a2aj)  =  A(a2)A(a2)  =  </,  i  *  j. 

If  the  elements  of  the  criterion  vector,  y,  arc  assumed  to  be  normally  dis¬ 
tributed,  the  elements  of  a,  being  linear  combinations  of  the  criterion  observa¬ 
tions,  will  also  be  normally  distributed.  Thus  we  have  (Cramdr,  194G,  p.  212): 

(140) 


E(a 0  =  3  <A 
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TABLE  5 

Estimated  and  Obtained  Measures  of  Accuracy  of  Prediction  Using 
Method  of  Largest  Principal-Axes  Factors 


OT 


PJ 


It  0 

Rc 

ci 

IF 

IF, 

IF. 

* 

Pi 

Pi 

1 

539 

53S 

048 

530 

582 

4SS 

712 

003 

763 

2 

519 

540 

048 

543 

590 

487 

705 

047 

704 

3 

519 

545 

04S 

540 

596 

4S7 

70S 

047 

705 

4 

550 

544 

048 

53S 

599 

491 

711 

013 

700 

5 

55S 

551 

048 

543 

003 

499 

705 

G3S 

753 

G 

559 

550 

048 

542 

008 

503 

707 

033 

749 

7 

5GS 

558 

04S 

548 

019 

513 

700 

019 

738 

S 

50S 

550 

048 

545 

020 

513 

703 

017 

738 

0 

508 

555 

048 

543 

020 

515 

700 

017 

737 

10 

568 

554 

049 

540 

020 

510 

709 

CIS 

736 

11 

571 

555 

049 

540 

013 

514 

709 

025 

73S 

12 

571 

554 

019 

537 

013 

514 

712 

025 

738 

13 

571 

552 

049 

534 

013 

514 

710 

025 

738 

14 

571 

551 

019 

532 

014 

511 

71S 

024 

741 

15 

578 

557 

049 

536 

013 

507 

714 

025 

747 

10 

5S3 

501 

019 

539 

017 

518 

711 

020 

734 

17 

584 

501 

049 

538 

017 

514 

713 

019 

739 

18 

590 

5GG 

049 

543 

024 

525 

70S 

011 

729 

10 

593 

50S 

019 

543 

022 

520 

707 

013 

728 

20 

594 

567 

049 

512 

029 

521 

709 

004 

734 

21 

009 

5S2 

048 

556 

014 

491 

693 

023 

770 

22 

011 

5S3 

048 

557 

019 

500 

693 

017 

707 

23 

015 

5S0 

048 

55S 

017 

496 

091 

020 

774 

24 

017 

5S7 

048 

558 

021 

492 

692 

010 

780 

25 

019 

588 

018 

558 

015 

4S8 

092 

023 

787 

20 

019 

587 

04S 

556 

015 

480 

695 

024 

789 

27 

022 

589 

04  S 

557 

010 

47S 

094 

030 

797 

28 

025 

590 

04S 

55S 

60S 

472 

093 

033 

805 

29 

026 

591 

049 

557 

013 

172 

094 

020 

SOO 

Aro  =  435 

R;  =  084 

R,  =  582 

Decimal  point  preceding  each  entry  has  been  omitted. 


Ranks 


52 


REDUCED  RANK  MODELS  FOR  MULTIPLE  PREDICTION 


TABLE  5  (Cont.) 

Estimated  and  Obtained  Measures  of  Accuracy  of  Prediction  Using 
Method  of  Largest  Principal-Axes  Factors 


It  o 

Rc 

& 

IF 

IF. 

IF, 

i? 

\p2 

1 

545 

544 

051 

542 

481 

502 

706 

770 

749 

2 

554 

550 

050 

547 

507 

516 

701 

743 

734 

3 

554 

549 

050 

544 

506 

514 

704 

744 

736 

4 

562 

555 

050 

549 

51S 

535 

699 

732 

714 

5 

562 

55-4 

0.50 

546 

519 

535 

702 

731 

714 

6 

508 

559 

050 

550 

530 

54  S 

69S 

720 

700 

7 

571 

560 

0,50 

550 

519 

533 

69S 

731 

716 

8 

578 

565 

050 

553 

518 

535 

694 

733 

715 

9 

585 

571 

050 

558 

518 

516 

689 

733 

737 

10 

586 

571 

050 

556 

516 

514 

692 

737 

740 

11 

587 

571 

050 

555 

514 

508 

693 

740 

747 

12 

587 

569 

051 

552 

517 

510 

697 

736 

746 

13 

588 

568 

051 

549 

518 

510 

700 

730 

746 

14 

588 

567 

051 

546 

518 

509 

703 

736 

746 

15 

591 

569 

051 

547 

522 

518 

703 

731 

738 

1G 

592 

56S 

051 

545 

516 

523 

705 

73S 

732 

17 

595 

570 

051 

546 

528 

536 

704 

724 

717 

18 

605 

579 

051 

554 

522 

542 

695 

730 

713 

19 

607 

579 

051 

553 

525 

540 

697 

728 

715 

20 

610 

5S2 

051 

554 

502 

535 

696 

755 

722 

21 

611 

581 

052 

552 

495 

533 

699 

763 

723 

22 

611 

579 

052 

550 

497 

532 

702 

761 

725 

23 

614 

582 

052 

551 

496 

530 

701 

762 

727 

24 

615 

581 

052 

549 

497 

525 

703 

761 

733 

25 

619 

583 

052 

550 

496 

515 

702 

766 

745 

2G 

619 

5S2 

052 

54S 

495 

516 

705 

767 

743 

27 

619 

581 

052 

545 

492 

517 

709 

771 

743 

28 

619 

579 

053 

542 

491 

516 

712 

772 

743 

29 

619 

578 

053 

539 

495 

515 

715 

767 

745 

A'o  =  390 

Ri  =  646 

R,  =  638 

Ranks 
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TABLE  5  (Cont.) 

Estimated  and  Obtained  Measures  of  Accuracy  of  Prediction  Using 
Method  of  Largest  Principal-Axes  Factors 


rq 

Re 

& 

w 

IF. 

TF, 

1? 

^1 

^2 

1 

59S 

596 

049 

595 

511 

531 

61G 

747 

723 

2 

601 

598 

049 

595 

524 

535 

040 

732 

71S 

3 

605 

GOO 

049 

596 

518 

530 

645 

741 

724 

4 

622 

GIG 

048 

G10 

516 

524 

62S 

753 

735 

5 

625 

6 18 

018 

Gil 

523 

530 

627 

742 

727 

G 

627 

018 

048 

010 

527 

536 

029 

735 

720 

7 

628 

G18 

048 

60S 

530 

536 

630 

■731 

721 

8 

630 

G18 

049 

607 

534 

535 

632 

72G 

722 

9 

630 

G17 

049 

601 

535 

535 

636 

726 

722 

10 

633 

019 

049 

605 

537 

532 

035 

724 

727 

11 

634 

619 

049 

603 

536 

531 

637 

727 

730 

12 

641 

624 

049 

608 

534 

513 

031 

735 

756 

13 

643 

G25 

049 

607 

531 

506 

633 

738 

768 

14' 

643 

G23 

049 

004 

530 

505 

630 

739 

768 

15 

652 

631 

049 

612 

519 

516 

628 

716 

759 

16 

651 

G33 

049 

012 

516 

507 

027 

722 

771 

17 

658 

G35 

049 

613 

543 

510 

020 

726 

766 

18 

659 

G35 

049 

Oil 

541 

513 

628 

728 

762 

19 

659 

G33 

049 

009 

511 

512 

632 

728 

763 

20 

661 

G34 

049 

609 

553 

519 

032 

712 

752 

21 

664 

636 

019 

609 

547 

521 

032 

719 

750 

22 

606 

637 

050 

610 

550 

518 

632 

716 

756 

23 

666 

030 

050 

607 

552 

519 

035 

712 

755 

24 

668 

637 

050 

607 

518 

51S 

630 

717 

757 

25 

673 

041 

050 

610 

535 

509 

632 

732 

769 

26 

673 

039 

050 

007 

535 

509 

636 

732 

769 

27 

674 

639 

050 

606 

535 

503 

038 

732 

778 

28 

675 

039 

051 

604 

532 

497 

040 

737 

788 

29 

67G 

638 

051 

602 

533 

502 

042 

736 

782 

j\ 

t,  =  345 

II 

o 

4— 

R .  =  630 

Ranks 
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REDUCED  RANK  MODELS  FOR  MULTIPLE  PREDICTION 


TABLE  5  (Cont  ) 

Estimated  ami  Obtained  Measures  of  Accuracy  of  Prediction  Using 
Method  of  Largest  Principal-Axes  Factors 

AJo  A’c  a  I V  ii  i  B'2  ^  1^2 

501  542  702  091  710 


1 

493 

490 

002 

4S7 

0 

524 

519 

000 

515 

3 

524 

517 

000 

511 

4 

525 

510 

001 

50G 

5 

552 

541 

059 

531 

0 

553 

540 

059 

528 

7 

553 

538 

0G0 

523 

S 

559 

542 

000 

525 

9 

559 

540 

000 

521 

10 

503 

542 

000 

521 

11 

504 

540 

001 

518 

12 

500 

540 

001 

515 

13 

50S 

540 

001 

513 

14 

508 

538 

002 

510 

15 

577 

545 

002 

510 

10 

579 

540 

062 

515 

17 

583 

548 

002 

515 

18 

590 

o54 

002 

520 

19 

593 

554 

002 

519 

20 

593 

553 

002 

515 

21 

595 

553 

003 

513 

22 

595 

550 

0G3 

509 

23 

590 

549 

004 

500 

24 

598 

549 

004 

504 

25 

598 

540 

005 

500 

20 

004 

552 

004 

504 

27 

GOO 

552 

005 

503 

2S 

007 

550 

005 

499 

29 

00s 

549 

Vo  =  300 

000 

490 

550 

545 

735 

092 

704 

557 

548 

740 

091 

701 

55S 

540 

744 

089 

702 

504 

540 

719 

GS2 

70S 

500 

539 

722 

080 

710 

502 

540 

727 

0S5 

709 

500 

554 

725 

080 

094 

500 

554 

730 

080 

094 

547 

505 

730 

791 

082 

551 

500 

734 

097 

081 

548 

509 

737 

700 

077 

547 

500 

739 

701 

081 

551 

503 

743 

090 

083 

514 

571 

738 

704 

075 

540 

574 

739 

702 

071 

547 

580 

739 

701 

005 

508 

585 

735 

077 

059 

500 

578 

730 

080 

000 

557 

575 

741 

090 

070 

554 

571 

743 

094 

074 

553 

573 

748 

094 

072 

553 

570 

752 

094 

075 

54S 

572 

755 

701 

073 

547 

572 

700 

702 

073 

525 

559 

755 

720 

088 

520 

551 

758 

721 

097 

529 

510 

702 

722 

703 

527 

550 

700 

723 

098 

=  004 

=  004 
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TABLE  5  (Cont.) 

Estimated  and  Obtained  Measures  of  Accurac3  of  Prediction  Using 
Method  of  Largest  Principal-Axes  Factors 


«o 

Rc 

a 

IK 

IK, 

IK* 

^ 1 

^2 

1 

559 

557 

001 

555 

501 

473 

692 

GS7 

7S0 

2 

593 

5S8 

05S 

584 

592 

443 

659 

050 

820 

3 

593 

5S7 

059 

5S0 

590 

143 

663 

G52 

820 

4 

593 

5S5 

059 

576 

590 

443 

6G9 

652 

819 

5 

595 

5S4 

0G0 

573 

597 

446 

672 

644 

817 

0 

59G 

5S3 

000 

570 

596 

439 

676 

645 

825 

7 

599 

5S3 

061 

56S 

612 

446 

678 

G27 

817 

8 

599 

5S1 

001 

564 

611 

446 

683 

627 

818 

9 

001 

581 

002 

562 

612 

440 

686 

G26 

821 

10 

001 

579 

062 

557 

012 

447 

G91 

626 

821 

11 

001 

577 

0G3 

553 

G14 

449 

696 

624 

818 

12 

G02 

57G 

063 

550 

60S 

441 

700 

631 

827 

13 

G04 

575 

064 

547 

607 

436 

704 

633 

832 

14 

005 

574 

064 

545 

616 

435 

707 

622 

834 

15 

G07 

573 

065 

542 

012 

43S 

711 

025 

832 

10 

G08 

572 

065 

539 

008 

441 

714 

630 

830 

17 

G12 

575 

065 

540 

OOS 

435 

714 

G31 

S36 

IS 

01S 

579 

065 

542 

620 

412 

711 

01G 

828 

19 

623 

582 

065 

544 

002 

431 

710 

638 

841 

20 

623 

5S0 

006 

540 

600 

435 

716 

640 

840 

21 

626 

581 

066 

539 

599 

435 

717 

042 

839 

22 

639 

593 

005 

551 

GOO 

443 

704 

G40 

833 

23 

G40 

593 

065 

549 

5S9 

449 

707 

054 

828 

24 

641 

591 

060 

545 

591 

449 

712 

652 

829 

25 

614 

592 

066 

545 

5S1 

452 

713 

6G7 

S28 

20 

G45 

592 

067 

513 

573 

457 

710 

67G 

821 

27 

645 

590 

007 

538 

572 

457 

722 

67S 

822 

2S 

046 

587 

oos 

534 

509 

456 

727 

GS2 

S23 

29 

G4G 

580 

069 

531 

576 

44S 

732 

G73 

832 

iV0  =  225 

f’l  =  71 

17 

Ri  =  503 

Ranks 
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REDUCED  RANK  MODELS  TOR  MULTIPLE  PREDICTION 


TABLE  5  (Gout.) 

Estimated  and  Obtained  Measures  of  Aeeuraey  of  Prediction  Using 
Method  of  Largest  Principal-Axes  Factors 


R  o 

Ro 

<2 

if 

IFi 

ir2 

'P 

Pi 

4-i 

1 

52S 

525 

071 

522 

498 

571 

728 

753 

676 

2 

537 

530 

071 

524 

506 

566 

726 

745 

OSO 

3 

53  S 

528 

072 

519 

507 

563 

731 

744 

684 

4 

53S 

525 

072 

512 

50S 

563 

73S 

743 

683 

5 

540 

531 

072 

515 

49S 

565 

736 

754 

681 

6 

583 

506 

009 

550 

525 

572 

699 

727 

673 

7 

001 

582 

067 

564 

519 

570 

683 

740 

677 

8 

007 

5S6 

007 

566 

522 

576 

682 

736 

670 

9 

007 

5S3 

008 

561 

521 

575 

68S 

737 

671 

10 

COS 

5S1 

009 

556 

522 

573 

694 

735 

673 

11 

009 

580 

070 

552 

521 

571 

698 

737 

077 

12 

Oil 

579 

070 

549 

526 

509 

702 

732 

679 

13 

010 

581 

070 

549 

520 

552 

703 

740 

701 

14 

610 

579 

071 

544 

519 

551 

710 

743 

699 

15 

032 

594 

070 

558 

509 

560 

694 

702 

695 

10 

033 

593 

071 

555 

514 

561 

098 

750 

695 

17 

639 

590 

071 

557 

500 

554 

696 

774 

705 

18 

047 

003 

070 

562 

499 

539 

691 

773 

725 

19 

047 

601 

071 

55S 

499 

538 

697 

774 

727 

20 

017 

59S 

072 

553 

499 

540 

704 

773 

725 

21 

651 

600 

072 

553 

501 

532 

704 

700 

732 

22 

053 

599 

073 

550 

507 

542 

708 

762 

720 

23 

053 

597 

074 

515 

506 

542 

715 

704 

722 

24 

058 

599 

074 

510 

500 

542 

714 

775 

724 

25 

000 

600 

074 

545 

4S8 

539 

716 

795 

730 

20 

664 

002 

074 

546 

490 

523 

716 

794 

754 

27 

605 

000 

075 

541 

486 

519 

722 

798 

759 

28 

005 

597 

070 

536 

486 

520 

729 

797 

758 

29 

670 

000 

076 

538 

493 

524 

728 

789 

757 

N  o  =  210 

7? ,  =  605 

lit  —  653 

It  a  nka 
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TABLE  5  (Cont.) 

Estimated  and  Obtained  Measures  of  Accuracy  of  Prediction  Using 
Method  of  Largest  Principal-Axes  Factors 


R<t 

R' 

W 

W, 

Wi 

1? 

^  1 

'i/i 

1 

544 

540 

078 

536 

587 

540 

712 

65S 

709 

2 

544 

536 

079 

52S 

588 

541 

721 

657 

708 

3 

549 

537 

OSO 

525 

581 

513 

725 

665 

705 

4 

563 

54S 

079 

533 

591 

537 

716 

652 

712 

5 

5S2 

564 

07  S 

546 

579 

539 

703 

665 

710 

6 

590 

569 

07  S 

54S 

578 

534 

701 

666 

716 

7 

593 

569 

079 

545 

5S2 

547 

705 

661 

702 

S 

60S 

5S1 

078 

555 

5S4 

537 

695 

660 

716 

9 

618 

5S8 

078 

560 

573 

538 

690 

676 

717 

10 

61S 

5S5 

079 

553 

574 

541 

698 

675 

714 

11 

639 

605 

077 

573 

563 

527 

677 

696 

736 

12 

645 

609 

077 

574 

557 

522 

675 

707 

743 

13 

615 

606 

078 

56S 

559 

523 

683 

704 

741 

14 

646 

003 

079 

562 

555 

522 

691 

710 

744 

15 

64S 

601 

OSO 

55S 

555 

529 

697 

712 

735 

1G 

64S 

598 

081 

552 

554 

529 

705 

713 

735 

17 

648 

594 

082 

515 

550 

528 

713 

719 

736 

IS 

649 

591 

084 

539 

550 

529 

721 

719 

735 

19 

649 

588 

0S5 

533 

552 

531 

729 

717 

729 

20 

650 

5S5 

0S6 

527 

55S 

53S 

737 

70S 

722 

21 

651 

5S3 

0S7 

522 

550 

536 

744 

721 

725 

22 

657 

586 

087 

523 

543 

532 

744 

733 

731 

23 

657 

582 

089 

516 

543 

532 

753 

733 

731 

24 

65S 

580 

090 

511 

544 

527 

761 

732 

741 

25 

659 

578 

091 

506 

539 

528 

767 

740 

740 

26 

659 

573 

093 

499 

539 

528 

777 

740 

740 

27 

659 

569 

094 

492 

53  S 

529 

787 

741 

738 

28 

665 

573 

091 

494 

551 

499 

7S6 

727 

778 

29 

666 

570 

096 

4S7 

555 

502 

794 

723 

777 

A'0  =  165 

/?,  =  679 

R,  =  646 

Ranks 
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REDUCED  R  ANK  MODELS  FOR  MULTIPLE  PREDICTION 


TAISLE  5  (Cent.) 

Estimated  and  Obtained  Measures  of  Accuracy  of  Prediction  Using 
Method  of  Largest  Principal-Axes  Factors 


Ro 

Re 

a 

11' 

I  IT 

IPs 

<h 

1 

551 

540 

001 

543 

540 

514 

705 

703 

73S 

5S2 

572 

08S 

503 

530 

543 

084 

71S 

700 

3 

5S2 

508 

090 

553 

530 

545 

005 

718 

704 

4 

5S2 

503 

002 

543 

534 

545 

700 

720 

704 

5 

582 

557 

094 

533 

534 

545 

71S 

721 

704 

G 

507 

508 

003 

540 

541 

535 

711 

71.3 

717 

7 

598 

503 

005 

531 

542 

533 

723 

712 

720 

S 

G07 

509 

000 

533 

521 

501 

721 

743 

759 

0 

G37 

508 

092 

501 

512 

505 

001 

754 

752 

10 

G38 

504 

004 

553 

510 

504 

701 

750 

753 

11 

047 

000 

091 

550 

509 

490 

009 

759 

770 

12 

047 

505 

000 

547 

509 

400 

711 

759 

770 

13 

047 

590 

098 

537 

500 

400 

723 

759 

770 

14 

000 

001 

007 

547 

520 

503 

713 

752 

75S 

15 

600 

590 

099 

538 

522 

502 

725 

749 

700 

10 

074 

000 

008 

549 

500 

4, SO 

714 

77 5 

700 

17 

G7S 

008 

099 

540 

100 

487 

720 

780 

783 

IS 

083 

010 

100 

544 

479 

479 

723 

810 

795 

10 

GS3 

005 

102 

530 

479 

473 

734 

810 

802 

20 

GOO 

022 

100 

553 

484 

474 

710 

817 

810 

21 

GOD 

017 

102 

544 

485 

473 

728 

810 

812 

22 

703 

017 

101 

541 

475 

4,80 

733 

835 

800 

23 

712 

024 

103 

547 

478 

454 

72S 

838 

852 

24 

713 

022 

105 

541 

482 

450 

730 

833 

802 

25 

725 

033 

104 

553 

400 

432 

724 

803 

805 

20 

720 

030 

100 

540 

402 

42S 

734 

S70 

905 

27 

735 

038 

105 

554 

44S 

434 

720 

004 

921 

28 

737 

035 

107 

548 

411 

440 

735 

917 

917 

29 

737 

030 

110 

539 

443 

441 

74S 

014 

010 

Aro  =  120 

Ri  =  038 

R,  =  042 

Ranks 
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TABLE  5  (Cont.) 

Estimated  and  Obtained  Measures  of  Accuracy  of  Prediction  Using 
Method  of  Largest  Principal- Axes  Factors 


Ho 

H 

a 

If' 

It', 

1(2 

1? 

<k 

1 

520 

510 

122 

501 

513 

592 

749 

737 

055 

2 

53G 

517 

123 

499 

197 

501 

752 

753 

050 

3 

5G3 

537 

122 

512 

4S5 

500 

710 

70S 

oso 

4 

G04 

573 

1 17 

544 

482 

5S0 

707 

783 

OGO 

C 

GOG 

507 

121 

531 

484 

58S 

724 

782 

057 

0 

G15 

509 

122 

527 

491 

591 

730 

774 

051 

7 

634 

584 

122 

537 

450 

570 

721 

S22 

0S4 

8 

G35 

570 

120 

522 

450 

508 

740 

830 

GS7 

9 

G35 

507 

130 

500 

451 

508 

700 

830 

080 

10 

G37 

501 

134 

494 

458 

509 

777 

820 

080 

11 

035 

575 

134 

505 

111 

557 

707 

847 

703 

12 

GG1 

57-1 

130 

499 

432 

500 

777 

803 

095 

13 

689 

004 

132 

529 

■420 

500 

7-15 

900 

710 

14 

7G3 

098 

10S 

03S 

388 

532 

000 

103-1 

792 

15 

703 

092 

112 

027 

3SS 

533 

020 

1034 

791 

1G 

707 

091 

115 

022 

389 

520 

034 

1030 

S10 

17 

797 

727 

105 

003 

400 

537 

578 

1027 

799 

18 

797 

722 

109 

053 

400 

530 

59-1 

1027 

801 

19 

798 

710 

113 

013 

404 

540 

Oil 

1023 

797 

20 

799 

712 

117 

034 

110 

543 

024 

1013 

791 

21 

799 

700 

121 

G24 

41 1 

543 

042 

1011 

790 

22 

807 

712 

121 

629 

390 

540 

037 

1084 

813 

23 

818 

723 

120 

040 

390 

514 

023 

1101 

809 

24 

820 

731 

120 

G  46 

3SI 

510 

015 

ini 

872 

25 

827 

725 

124 

030 

370 

512 

633 

1123 

883 

2G 

850 

758 

113 

070 

3S1 

504 

573 

122S 

984 

27 

S50 

752 

11S 

GOO 

384 

504 

590 

1230 

9S9 

28 

S50 

740 

123 

055 

385 

505 

009 

1220 

988 

2D 

854 

748 

125 

055 

3S9 

491 

Oil 

1251 

1030 

Ar0  =  75 

It.  =  008 

R.  =  084 
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REDUCED  RANK  MODELS  FOR  MULTIPLE  PREDICTION 


TABLE  5  (Cont.) 

Estimated  and  Obtained  Measures  of  Accuracy  of  Prediction  Using 
Method  of  Largest  Principal- Axes  Factors 


Rq 

Jthc 

d 

}V 

Wi 

W, 

if 

fa 

fa 

1 

393 

573 

176 

555 

429 

411 

694 

817 

838 

2 

593 

552 

191 

514 

429 

411 

742 

817 

S37 

3 

662 

613 

180 

567 

503 

545 

687 

754 

709 

4 

681 

617 

187 

560 

492 

537 

701 

766 

718 

5 

690 

610 

199 

539 

467 

529 

733 

791 

725 

6 

722 

634 

199 

557 

426 

479 

718 

836 

779 

7 

732 

628 

211 

538 

412 

473 

747 

850 

786 

8 

744 

626 

222 

526 

424 

477 

770 

816 

790 

9 

759 

629 

232 

520 

432 

509 

786 

852 

758 

10 

803 

683 

215 

581 

347 

413 

712 

9S4 

909 

11 

823 

701 

215 

597 

336 

442 

695 

1015 

865 

12 

824 

682 

237 

564 

337 

440 

750 

1012 

867 

13 

830 

671 

256 

542 

317 

427 

789 

1036 

878 

14 

837 

661 

275 

523 

321 

427 

825 

1045 

887 

IS 

843 

650 

297 

501 

279 

403 

866 

1117 

938 

16 

845 

623 

332 

459 

270 

400 

938 

1126 

943 

17 

848 

592 

372 

413 

264 

419 

1018 

1152 

933 

18 

853 

566 

411 

375 

257 

387 

108S 

1181 

980 

19 

872 

589 

418 

398 

305 

447 

1067 

1169 

957 

20 

906 

082 

364 

513 

222 

411 

S92 

1547 

1169 

21 

911 

660 

409 

478 

196 

391 

959 

1648 

1237 

22 

915 

625 

473 

420 

198 

376 

1057 

170S 

1326 

23 

952 

773 

336 

627 

138 

279 

712 

2427 

2056 

24 

964 

805 

317 

672 

164 

274 

634 

3041 

2310 

25 

972 

820 

321 

692 

143 

287 

600 

4042 

3290 

26 

978 

824 

345 

694 

134 

215 

598 

4934 

3873 

27 

988 

875 

281 

775 

209 

164 

445 

6529 

5785 

28 

995 

916 

219 

844 

196 

092 

310 

9073 

8519 

29 

999 

975 

081 

951 

-085 

-077 

099 

* 

* 

> 

II 

CO 

o 

Ri  =  619 

I?2  =  07 

2 

Value  greater  than  ten. 
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Flitting  (139)  nnd  (140)  in  (138),  wc  obtain 

(141)  77[(a'a)2]  =  (N  -  L)(N  -  L  +  2>4. 

Then,  putting  (141)  and  (137)  in  (136),  we  may  write 


(142)  Var  (a' a)  =  2(N  -  L)a\ 

From  (135)  the  variance  of  Tp  will  be 

,  _  2 (N  +  L)V 


(143) 


N  -  L 


For  an  unbiased  estimate  of  a  we  use  (141)  and  (95)  to  obtain 


(144) 


2(1  -  R?y(N  +  L)2 
(N  -  L)2(N  -  L  +  2) 


The  values  for  &  given  in  Table  5  were  computed  from  the  square  root  of  (144). 

In  discussing  Table  5,  we  will  consider  first  the  1C  new  samples  corre¬ 
sponding  to  the  original-sample  sizes  of  120  and  up.  With  a  few  exceptions, 
the  estimated  errors  of  prediction  did  not  differ  from  the  obtained  values  by 
more  than  one  or  two  times  the  standard  error  of  the  estimate.  In  the  full- 
rank  case,  for  example,  the  difference  between  \p  and  $  was  less  than  a  in 
eight  samples,  between  a  and  2 &  in  six  samples,  and  between  2<2  and  3a  in 
two  samples.  Ten  of  the  obtained  values  fell  above  the  estimated  and  six 
fell  below.  Estimates  for  the  lower  ranks  tended  to  be  more  accurate.  The 
weight-validities  and  their  estimates  evidently  were  less  variable  than  the 
errors  of  prediction.  Though  no  estimate  of  the  standard  error  of  TF  is  avail¬ 
able,  its  accuracy  is  apparently  comparable  to  that  of  i p.  Taking  iuto  con¬ 
sideration  the  variability  of  the  obtained  measures  of  accuracy,  both  statistics 
appear  to  be  fairly  good  estimates  of  the  corresponding  expected  values, 
though  their  standard  errors  are  rather  larger  than  one  could  wish. 

Of  perhaps  more  significance  than  the  absolute  magnitudes  of  the 
expected  values  for  \p  and  W  are  the  relative  magnitudes  from  one  rank  to 
another.  As  a  rough  indication  of  how  feasible  it  would  be  to  base  the  choice 
of  the  rank  to  be  used  on  we  may  compare  the  values  of  \p  corresponding  to 
the  rank  for  which  \p  wras  smallest  with  the  full-rank  \p.  Again  considering 
only  the  16  new  samples  corresponding  to  the  original-sample  size  of  120  and 
above,  we  see  that  in  15  of  the  16  instances,  the  reduced-rank  weights  so 
chosen  gave  more  accurate  predictions  than  did  the  full-rank  weights.  Some 
of  these  improvements  were,  of  course,  veiy  small.  For  example,  in  only  8 
of  the  16  new  samples  was  the  reduction  in  total  squared  errors  of  prediction 
as  large  as  4  per  cent.  The  largest  reductions  were  22.9  per  cent  and  21.4  per 
cent,  both  for  weights  from  the  original  sample  of  120  cases.  Just  how  large 
the  reduction  would  have  to  be  to  attain  practical  significance  is,  of  course, 
debatable. 
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In  an  effort  to  evaluate  the  success  of  \p  as  an  indicator  of  the  rank 
corresponding  to  the  lowest  expected  error  of  prediction,  two  comparisons 
were  made.  First,  it  would  seem  reasonable  to  require  that  the  total  squared 
errors  of  prediction  for  the  selected  rank  be  closer  to  the  lowest  value  obtained 
in  a  given  sample  than  to  the  highest.  This  is  the  case,  however,  in  only  9  of 
the  16  samples.  A  second  comparison,  intended  to  control  for  variability  in 
the  obtained  errors  of  prediction,  was  made  on  the  basis  of  the  rank  orders 
(from  lowest  to  highest)  of  these  values  in  the  individual  samples.  For  each 
member  of  each  pair  of  samples  corresponding  to  a  particular  original  sample, 
the  rank  corresponding  to  rank-order  1  was  determined.  The  rank  order  in 
the  opposite  member  of  the  pair  of  the  error  of  prediction  corresponding  to 
the  optimal  rank  in  the  first  member  was  then  obtained.  The  average  of 
these  16  rank  orders  was  7.4,  suggesting  a  fair  degree  of  stability  in  optimal 
rank.  In  contrast  to  this  value,  the  average  rank  order  of  the  errors  of  pre¬ 
diction  corresponding  to  the  select'd  ranks  was  12.4.  Since,  if  the  ranks  had 
been  selected  at  random,  the  expected  rank  order  would  be  15,  it  appears 
that  i p  does  not  provide  a  satisfactory  basis  for  selection.  However,  a  better 
basis  docs  not  appear  to  be  available. 

We  consider  now  the  results  of  Table  5  for  the  original-sample  sizes  of  75 
and  30.  For  the  higher  ranks,  both  estimates  appear  to  break  down  com¬ 
pletely.  For  the  lower  ranks,  taking  into  account  the  large  standard  errors, 
the  two  estimates  appear  to  do  about  as  well  as  in  the  larger  samples.  Because 
of  these  large  standard  errors,  however,  $  and  It-'  arc  not  very  helpful  as 
guides  to  the  absolute  magnitude  of  the  corresponding  expected  values.  If 
taken  as  an  aid  to  judgment  rather  than  as  an  index  to  be  applied  blindly, 
\p  in  particular  might  be  of  value  in  arriving  at  an  optimal  rank.  In  the  original 
sample  of  size  30,  the  lowest  value  of  $  for  ranks  below  24  occurred  for  rank  3. 
Very  little  judgment  is  required  to  select  a  rank-3  solution  in  preference  to  a 
solution  of  rank  24  or  more  on  a  sample  of  30  cases.  As  it;  turned  out,  the 
optimal  rank  was  in  fact  3  in  both  cross-validation  samples.  In  the  original 
sample  of  size  75,  the  alternative  to  a  rank-1  solution  would  be  one  of  rank  14 
or  more.  For  samples  of  75  cases  an  optimal  rank  of  14  is  certainly  possible, 
though  unlikely.  In  any  event,  it  appears  that,  providing  unrealistically  low 
values  for  higher  ranks  are  ignored,  ^  is  potentially  of  some  value  in  deciding 
what  rank  to  use  for  small  samples  as  well  as  for  large  ones. 

It  will  Ire  recalled  that,  in  deriving  and  IF,  the  assumption  was  made 
that  the  factor  loadings  of  the  predictor  matrix  would  be  constant  from  sample 


these  statistics  may  bo  due  to 


the  failure  to  take  sampling  variation  of  the  factor  loadings  into  account. 


This,  of  course,  could  not  have  been  done  within  the  context  of  regression 


theory,  since  there  only  the  criterion  variable  is  considered  random.  The 


regression  model  was  selected  for  this  study  largely  on  the  basis  of  its  sim¬ 
plicity,  but  also  on  the  grounds  that  it  is  the  model  generally  used  in  con- 
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ncctiou  with  prediction  problems.  However,  it  seems  likely  that  an  analysis 
of  prediction  problems  in  terms  of  the  multivariate  normal  model  of  correla¬ 
tion  theory  or  in  terms  of  some  other  model  where  the  predictor  variables 
arc  considered  random  would  lead  to  more  successful  estimates  of  accuracy 
of  prediction  than  those  obtained  using  regression  theory. 


OFT  AFTER  4 


SUMMARY  AND  CONCLUSIONS 

The  primary  concern  of  this  study  has  been  with  the  possibility  of  using 
reduced-rank  solutions  for  regression  weights  to  increase  the  accuracy  of 
prediction  obtainable  in  future  samples.  Using  regression  theory,  a  general 
factor  model  for  reduced-rank  prediction  was  developed.  It  was  shown  that, 
if  errors  in  the  criterion  observations  arc  not  to  be  capitalized  upon,  the 
optimal  basis  for  determining  a  lower-rank  solution  will  be  the  amount  of 
variance  accounted  for  in  the  predictor  data  matrix.  Thus  the  best  alternative 
to  reduced-rank  methods  that  seek  to  obtain  the  maximum  multiple  correla¬ 
tion  with  the  criterion  would  be  the  method  of  largest  principal-axes  factors, 
as  suggested  by  Horst  (1941).  Estimates  of  the  weight-validities  and  total 
squared  errors  of  prediction  to  be  expected  when  a  particular  set  of  weights 
is  applied  in  future  samples  were  also  derived. 

An  empirical  comparison  of  five  particular  reduced-rank  methods  was 
carried  out,  using  29  predictors  and  with  partial  replication  on  five  criteria, 
Weights  were  computed  on  samples  ranging  from  30  to  435  cases.  As  expected, 
the  method  of  largest  principal-axes  factors  was  markedly  superior  to  the 
other  methods  tested.  This  superiority  was  quite  general,  appearing  in  all 
samples  for  some  criteria,  and  in  some  samples  for  all  criteria.  The  above 
finding,  together  with  the  very  poor  showing  of  the  method  of  smallest 
principal-axes  factors,  supports  the  conclusion  regarding  the  importance  of 
predictor  variance  accounted  for  by  the  lower-rank  system.  The  fact  that 
the  largest  principal-axes  factors  tended  to  give  more  accurate  predictions 
than  d:d  the  principal-axes  factors  having  the  highest  multiple  correlation 
with  the  criterion  suggests  the  desirability  of  selecting  predictors  indepen¬ 
dently  of  the  criterion  observations.  The  exceptions  to  this  trend  for  the 
larger  original-sample  sizes  on  some  criteria  indicates  the  desirability  of 
developing  some  sort  of  statistical  test  for  deciding  when  the  predictor- 
selection  methods  using  the  criterion  observations  may  be  advantageously 
applied. 

Although  their  standard  errors  were  rather  large,  especiallv  in  small 
samples,  the  estimates  of  weight-validity  and  of  total  squared  errors  of  pre¬ 
diction  to  bo  expected  in  future  samples  appeared  to  be  reasonably  serviceable 
as  regards  absolute  magnitude.  As  to  relative  magnitude  from  one  rank  to 
another,  however,  it  may  be  questioned  whether  a  rank  chosen  on  the  basis 
of  these  estimates  would  be  preferable  to  a  rank  chosen  at  random.  As  esti¬ 
mates  of  either  absolute  or  relative  magnitude,  it  seems  likely  that  the 
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statistics  derived  here  could  be  substantially  improved  upon  if  variation  in 
the  predictor  variables  or  in  their  factor  loadings  were  taken  into  account. 
Without  such  improved  estimates,  the  large  potential  advantages  of  reduced- 
rank  methods  demonstrated  here  cannot  be  fully  realized.  Thus  it  would 
seem  well  worthwhile  to  undertake  an  analysis  of  prediction  problems  using 
a  statistical  model  which,  unlike  regression  theory,  treats  the  predictors  as 
random  variables. 

Until  more  efficient  methods  arc  developed,  it  is  suggested  that  a  regres¬ 
sion  equation  based  on  the  subset  of  largest  principal-axes  factors  for  which 
4  is  smallest  will  be  the  best  available.  For  samples  with  less  than,  say,  50 
degrees  of  freedom,  this  procedure  must  be  supplemented  by  a  subjective 
process  to  the  extent  of  ignoring  low  values  of  4  for  ranks  of  say,  ten  or  more. 
Although  this  procedure  leaves  considerable  room  for  improvement,  the 
relevant  evidence  seems  sufficiently  favorable  to  warrant  further  empirical 
research.  At  any  rate,  the  strong  possibility  has  been  raised  that  the  con¬ 
ventional  full-rank  weights  can  almost  always  be  improved  upon  even  in 
samples  of  several  hundred  cases.  Such  weights,  moreover,  may  give  predic¬ 
tions  only  slightly  more  accurate  than  those  made  from  weights  obtainable 
with  samples  of  as  ferv  as  30  cases. 
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